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PREFACE 


These unforeseen stoppages, 
which I own I had no conception of when I first set out; 
but which, I am convinced now, will rather increase than diminish as I advance, 

—— have struck out a hint which I am resolved to follow; 

—— and that is， —— not to be in a hurry; 
but to go on leisurely, writing and publishing two volumes of my life every year; 
—— which, if I am suffered to go on quietly, and can make a tolerable bargain 

with my bookseller, I shall continue to do as long as I live. 

— LAURENCE STERNE, The Life and Opinions of 

Tristram Shandy, Gentleman (1759) 


This booklet contains draft material that I’m circulating to experts in the 
field, in hopes that they can help remove its most egregious errors before too 
many other people see it. I am also, however, posting it on the Internet for 
courageous and/or random readers who don’t mind the risk of reading a few 
pages that have not yet reached a very mature state. Beware: This material 
has not yet been proofread as thoroughly as the manuscripts of Volumes 1,2, 
3, and 4A were at the time of their first printings. And those carefully-checked 
volumes ， alas，were subsequently found to contain thousands of mistakes. 

Given this caveat, I hope that my errors this time will not be so numerous 
and/or obtrusive that you will be discouraged from reading the material carefully. 
I did try to make the text both interesting and authoritative, as far as it goes. 
But the field is vast; I cannot hope to have surrounded it enough to corral it 
completely. So I beg you to let me know about any deficiencies that you discover. 

To put the material in context, this long pre-fascicle contains Section 7.2.2.2 
of a long，long chapter on combinatorial algorithms. Chapter 7 will eventually 
fill at least four volumes (namely Volumes 4A ， 4B ， 4C，and 4D)，assuming that 
Pm able to remain healthy. It began in Volume 4A with a short review of graph 
theory and a longer discussion of “Zeros and Ones” (Section 7.1); that volume 
concluded with Section 7.2.1, “Generating Basic Combinatorial Patterns，” which 
was the first part of Section 7.2, “Generating All Possibilities •” Volume 4B will 
begin with Section 7.2.2, about backtracking in general, and Section 7.2.2.1 will 
discuss a family of methods called “dancing links，” for updating data structures 
while backtracking. That sets the scene for the present section，which applies 
those ideas to the important problem of Boolean satisfiability. 
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PREFACE 


氺氺氺 

After working on this material for more than three years, I’ve finally gotten most 
of the major topics into place. Each time I finish drafting a small piece of the 
final big picture ， I’ve been testing it by adding it to these pages; hence there now 
are quite a few scraps of text and exercises，which I plan to refine and polish (if 
the FORCE stays with me). 

I hope you will see why I’ve found this topic to be such a fascinating story; 
and I hope you’ll not get too lost as I move through different parts of the tale. 
I’ve tried to explain things in a natural order. 


氺氺氺 

My notes on combinatorial algorithms have been accumulating for more than 
fifty years, yet I fear that in many respects my knowledge is woefully behind the 
times. Please look，for example, at the exercises that I’ve classed as research 
problems (rated with difficulty level 46 or higher), namely exercises 5, 39 ， 112, 
193 ， 194 ， 236 ， 283 ， 516，…； exercise 223 is also currently unsolved，although 
I’ve rated it only c 40’ because I once thought of an answer (which I have since 
forgotten!). IVe also implicitly mentioned or posed additional unsolved questions 
in the answers to exercises 18, 19, 68 , 84, 105(c,e), 111, 132, 183, 194, 204, 205, 
316, 335, 351, 360, 365, 372, 397, 409(c), 476, 480, 486, 487, 488, 501, 511, 515, 

•… And I still haven’t solved exercise 68 . Are those problems still open? Please 
inform me if you know of a solution to any of those intriguing enigmas. And of 
course if no solution is known today but you do make progress on any of them 
in the future, I hope you’ll let me know. 

I urgently need your help also with respect to some exercises that I made 
up as I was preparing this material. I certainly don’t like to receive credit for 
things that have already been published by others, and most of these results are 
quite natural “fruits” that were just waiting to be “plucked •” Therefore please 
tell me if you know who deserves to be credited, with respect to the ideas found 
in exercises 18, 19, 20, 21, 22, 24, 29, 38(b), 62, 63, 65(b), 74, 84(c,d,e), 101, 108, 
132, 133, 149, 151, 161, 162, 177, 180, 181, 188, 191, 204(b,c,d), 206, 207, 208, 
220, 228, 229, 232, 239, 242, 252, 259, 270, 272, 273, 279, 280, 282, 300, 305, 
310, 311, 312, 327, 328, 329, 334, 335, 336(b), 337, 343, 349, 357, 358, 361, 390, 
396, 399(c), 404, 406, 410, 411, 414, 419, 423, 427, 432, 433, 435, 439, 462, 463, 
464, 465, 470, 472, 473, 475(d,e,f,g), 476, 479, 495, 498, , and/or the answers 

to exercises •… Furthermore I’ve credited exercise 170 to unpublished work of 
Heule. Have any of those results ever appeared in print, to your knowledge? 

I also wonder if Eq. 7.2.2.2—( 169 ) is “well known •” 
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PREFACE 


V 


Special thanks are due to Armin Biere, Randy Bryant, Sam Buss, Niklas Een ， 
Ian Gent，Marijn Heule，Holger Hoos，Svante Janson，Peter Jeavons，Daniel 
Kroening，Oliver Kullmann，Massimo Lauria，Wes Pegden，Will Shortz，Cars ten 
Sinz, Niklas Sorensson, Udo Wermuth, Ryan Williams, and … for their detailed 
comments on my early attempts at exposition，as well as to numerous other cor¬ 
respondents who have contributed crucial corrections. Thanks also to Stanford’s 
Information Systems Laboratory for providing extra computer power when my 
laptop machine was inadequate. 


氺氺氺 

Wow — Section 7.2.2.2 has turned out to be the longest section, by far, in 
The Art of Computer Programming. The SAT problem is evidently a “killer 
app，” because it is key to the solution of so many other problems. Consequently 
I can only hope that my lengthy treatment does not also kill off my faithful 
readers! As I wrote this material, one topic always seemed to flow naturally 
into another 5 so there was no neat way to break this section up into separate 
subsections. (And anyway the format of TAOCP doesn’t allow for a Section 
7.2.2.2.1.) 

I’ve tried to ameliorate the reader’s navigation problem by adding subhead¬ 
ings at the top of each right-hand page. Furthermore, as in other sections ， 
the exercises appear in an order that roughly parallels the order in which corre¬ 
sponding topics are taken up in the text. Numerous cross-references are provided 
between text, exercises, and illustrations，so that you have a fairly good chance of 
keeping in sync. I’ve also tried to make the index as comprehensive as possible. 

I wrote more than three hundred computer programs while preparing this 
material, because I find that I don’t understand things unless I try to program 
them. Most of those programs were quite short, of course; but several of them 
are rather substantial, and possibly of interest to others. Therefore I’ve made a 
selection available by listing some of them on the following webpage: 

http : //www-cs-facuity•Stanford•edu/^knuth/programs•html 

I happily offer a “finder’s fee” of $2.56 for each error in this draft when it is 
first reported to me，whether that error be typographical, technical, or historical. 
The same reward holds for items that I forgot to put in the index. And valuable 
suggestions for improvements to the text are worth 32^f each. (Furthermore, if 
you find a better solution to an exercise, I’ll actually do my best to give you 
immortal glory, by publishing your name in the eventual book:—) 

Volume 4B will begin with a special tutorial and review of probability 
theory，in an unnumbered section entitled “Mathematical Preliminaries Redux •” 
References to its equations and exercises use the abbreviation C MPR’. (Think of 
the word “improvement •”) A preliminary version of that section can be found 
online, as pre-fascicle 5a to Volume 4, if you knew how to find this one. 

Cross references to yet-unwritten material sometimes appear as c 00 5 ; this 
impossible value is a placeholder for the actual numbers to be supplied later. 
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Vi PREFACE 

The notational conventions that I’ve used in the mathematical formulas notationai conventions 

of this section are summarized either in the Index to Notations of Volume 4A Knuth 

(Appendix B on pages 822-827) or under the heading c Notational conventions’ 
in the index below. 

Happy reading! 

Stanford，California D. E. K. 

69 Umbruary 2015 
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Here’s an exercise for Section 7.2.2.1 that I plan to put eventually into fascicle 5: 

00. [20] The problem of Langford pairs on {1 ， 1 ， • • • ， n ， n} can be represented as an 
exact cover problem using columns {di^... ^d n }U {si, • • • ， S 2 n}] the rows are di Sj Sk for 
1 < i < n and 1 < j < k < 2n and k = i+j + 1，meaning “put digit i into slots j and k •” 
However, that construction essentially gives us every solution twice, because the 
left-right reversal of any solution is also a solution. Modify it so that we get only half 
as many solutions; the others will be the reversals of these- 


Langford pairs 
exact cover problem 
symmetry breaking 
DEFOE 
Crusoe 


And here’s its cryptic answer (needed in exercise 7.2.2.2-13): 

00. Omit the rows with i = n — [n even] and j > n/2. 

(Other solutions are possible. For example, we could omit the rows with i = 1 and 
j > n; that would omit n — 1 rows instead of only |_n/2」. However, the suggested rule 
turns out to make the dancing links algorithm run about 10% faster.) 


Now I saw, tho’ too late，the Folly of 
beginning a Work before we count the Cost, 
and before we judge rightly of our own Strength to go through with it. 

—— DANIEL DEFOE, Robinson Crusoe (1719) 
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7.2.2.2 


SATISFIABILITY 


1 


He reaps no satisfaction but from low and sensual objects, 

or from the indulgence of malignant passions. 

— DAVID HUME, The Sceptic (1742) 


HUME 

JAGGER 

RICHARDS 

Boolean formula 

unsatisfiable 

satisfying assignment 

consistent, see satisfiable 

inconsistent, see unsatisfiable 

P = NP 

NP-complete problem 

Knuth 

SAT solvers 


I can't get no … 

MICK JAGGER and KEITH RICHARDS, Satisfaction (1965) 


7.2.2.2. Satisfiability. We turn now to one of the most fundamental problems 
of computer science: Given a Boolean formula F(x\^ ... ， x n ), expressed in so- 
called u conjunctive normal form” as an AND of ORs，can we “satisfy” F by 
assigning values to its variables in such a way that F(x\^... ^x n ) = 1? For 
example, the formula 

F{xi,x 2 ,x^) = (xi V x 2 ) A (x 2 V x 3 ) A (^1 V x 3 ) A (xi V V x 3 ) ( 1 ) 

is satisfied when X 1 X 2 XS = 001. But if we rule that solution out, by defining 

G(x 1 ,x 2 ,x 3 ) = F(x 1 ,x 2 ,xs) a (xi V $2 V X 3 ), (2) 

then G is unsatisfiable: It has no satisfying assignment. 

Section 7.1.1 discussed the embarrassing fact that nobody has ever been 
able to come up with an efficient algorithm to solve the general satisfiability 
problem, in the sense that the satisfiability of any given formula of size N could be 
decided in N 0 、 1 、 steps. Indeed，the famous unsolved question “does P = NP?” 
is equivalent to asking whether such an algorithm exists. We will see in Section 
7.9 that satisfiability is a natural progenitor of every NP-complete problem.* 
On the other hand enormous technical breakthroughs in recent years have 
led to amazingly good ways to approach the satisfiability problem. We now 
have algorithms that are much more efficient than anyone had dared to believe 
possible before the year 2000. These so-called “SAT solvers” are able to handle 
industrial-strength problems, involving millions of variables, with relative ease, 
and they’ve had a profound impact on many areas of research such as computer- 
aided verification. In this section we shall study the principles that underlie 
modern SAT-solving procedures. 

* At the present time very few people believe that P = NP [see SIGACT News 43, 2 (June 
2012), 53-77]. In other words, almost everybody who has studied the subject thinks that 
satisfiability cannot be decided in polynomial time. The author of this book, however, suspects 
that TV^^O-step algorithms do exist, yet that they’re unknowable. Almost all polynomial time 
algorithms are so complicated that they lie beyond human comprehension, and could never be 
programmed for an actual computer in the real world- Existence is different from embodiment- 
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COMBINATORIAL ALGORITHMS (F6A: 23 Sep 2015 @ 1959) 


7 . 2 . 2.2 


To begin ， let’s define the problem carefully and simplify the notation, so 
that our discussion will be as efficient as the algorithms that we’ll be considering. 
Throughout this section we shall deal with variables, which are elements of any 
convenient set. Variables are often denoted by X 2 ^ 冗 3 , • •. ， as in (i); but any 
other symbols can also be used，like a ， 6 , c，or even d^ A . We will in fact often use 
the numerals 1 ， 2, 3，… to stand for variables; and in many cases well find it 
convenient to write just j instead of Xj, because it takes less time and less space 
if we don’t have to write so many x^s. Thus ‘2’ and L X 2 will mean the same 
thing in many of the discussions below. 

A literal is either a variable or the complement of a variable. In other words ， 
if r is a variable, both v and v are literals. If there are n possible variables in 
some problem，there are 2n possible literals. If l is the literal 元 2 , which is also 
written 2 , then the complement of /，『， is X 2 , which is also written 2 . 

The variable that corresponds to a literal l is denoted by |/|; thus we have 
v\ = \v\ = v for every variable v. Sometimes we write 士 r for a literal that is 
either v or v. We might also denote such a literal by av^ where a is ±1. The 
literal l is called positive if |/| = /; otherwise |/| = 『， and l is said to be negative. 

Two literals l and V are distinct if l / They are strictly distinct if \l\ 妾 \l f \. 
A set of literals {“， …， 4} is strictly distinct if |/^| / |/j| for 1 < i < j < fc. 

The satisfiability problem，like all good problems, can be understood in many 
equivalent ways，and we will find it convenient to switch from one viewpoint to 
another as we deal with different aspects of the problem. Example ( 1 ) is an AND 
of clauses, where every clause is an OR of literals; but we might as well regard 
every clause as simply a set of literals，and a formula as a set of clauses. With 
that simplification，and with identical to c j\ Eq. ( 1 ) becomes 

F = {{1,2},{2,3},{1,3},{1,2,3}}. 

And we needn’t bother to represent the clauses with braces and commas either; 
we can simply write out the literals of each clause. With that shorthand we’re 
able to perceive the real essence of ( 1 ) and ( 2 ): 

F = {12,23,13,123}, G = FU{123}. ( 3 ) 

Here F is a set of four clauses, and G is a set of five. 

In this guise, the satisfiability problem is equivalent to a covering problem, 
analogous to the exact cover problems that we considered in Section 7.2.2.1: Let 

Tn 二 = {ll, 22 ,... ,nn}. ( 4 ) 

“Given a set F = {C\^... ^C m }^ where each Ci is a clause and each clause 
consists of literals based on the variables {$i ， …• ， : r n }，find a set L of n literals 
that ^covers 5 F U T n , in the sense that every clause contains at least one element 
of L.” For example，the set F in ( 3 ) is covered by L = {I ， 2, 3}，and so is the set 
T 3 ; hence F is satisfiable. The set G is covered by {1 ， I ， 2} or {1 ， I ， 3} or … or 
{2, 3, 3}，but not by any three literals that also cover T 3 ; so G is unsatisfiable. 

Similarly，a family F of clauses is satisfiable if and only if it can be covered 
by a set L of strictly distinct literals. 


variables 

literal 

notation: V 
positive 
negative 
distinct 

strictly distinct 
covering problem 
exact cover problems 
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7.2.2.2 


SATISFIABILITY 


3 


If F 1 is any formula obtained from F by complementing one or more vari¬ 
ables, it’s clear that F 1 is satisfiable if and only if F is satisfiable. For example ， 
if we replace 1 by I and 2 by 2 in ( 3 ) we obtain 

F' = {12,23,13,123}, G = F U {123}. 

In this case F 1 is trivially satisfiable, because each of its clauses contains a 
positive literal: Every such formula is satisfied by simply letting L be the set of 
positive literals. Thus the satisfiability problem is the same as the problem of 
switching signs (or “polarities”) so that no all-negative clauses remain. 

Another problem equivalent to satisfiability is obtained by going back to the 
Boolean interpretation in ( 1 ) and complementing both sides of the equation. By 
De Morgan’s laws 7.1.1—( 11 ) and ( 12 ) we have 

F(x 1 ,x 2 ,x 3 ) = (^1 Ax 2 ) V (x 2 Axs) V (xi Ax 3 ) V (xi A x 2 A 无 3 ); ( 5 ) 

and F is unsatisfiable 4=^ F = 0 F = 1 F is a tautology. Consequently 
F is satisfiable if and only if F is not a tautology: The tautology problem and 
the satisfiability problem are essentially the same.* 

Since the satisfiability problem is so important, we simply call it SAT. And 
instances of the problem such as (i), in which there are no clauses of length 
greater than 3， are called 3SAT. In general, A:SAT is the satisfiability problem 
restricted to instances where no clause has more than k literals. 

Clauses of length 1 are called unit clauses^ or unary clauses. Binary clauses, 
similarly, have length 2 ; then come ternary clauses，quaternary clauses，and so 
forth. Going the other way，the empty clause^ or null ary clause, has length 0 and 
is denoted by e; it is always unsatisfiable. Short clauses are very important in al¬ 
gorithms for SAT, because they are easier to deal with than long clauses. But long 
clauses aren’t necessarily bad; they’re much easier to satisfy than the short ones. 

A slight technicality arises when we consider clause length: The binary 
clause (xi V X 2 ) in ( 1 ) is equivalent to the ternary clause (xi W xi W X 2 ) as well 
as to (xi V X 2 V X 2 ) and to longer clauses such as (xi W x\W x\W X 2 )] so we can 
regard it as a clause of any length > 2. But when we think of clauses as sets 
of literals rather than ORs of literals, we usually rule out multisets such as 112 
or 122 that aren’t sets; in that sense a binary clause is not a special case of a 
ternary clause. On the other hand，every binary clause (x V y) is equivalent to 
two ternary clauses ， (x \/ y \/ z) A (x W y W z)/if z is another variable; and every 
A:-ary clause is equivalent to two (k + l)-ary clauses. Therefore we can assume ， 
if we like, that A：SAT deals only with clauses whose length is exactly k. 

A clause is tautological (always satisfied) if it contains both v and v for some 
variable v. Tautological clauses can be denoted by p (see exercise 7.1.4-222). 
They never affect a satisfiability problem; so we usually assume that the clauses 
input to a SAT-solving algorithm consist of strictly distinct literals. 

When we discussed the 3SAT problem briefly in Section 7.1.1，we took a look 
at formula 7.1.1—( 32 )，“the shortest interesting formula in 3CNF.” In our new 

* Strictly speaking，TAUT is coNP-complete, while SAT is NP-complete; see Section 7.9. 


trivially SAT 
polarities 
De Morgan’s laws 
TAUT 

coNP-complete 

SAT 

kSAT 

3SAT 

unit clauses 
unary clauses 
Binary clauses 
ternary clauses 
empty clause 
nullary clause 
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4 COMBINATORIAL ALGORITHMS (F6A: 23 Sep 2015 @1959) 7 . 2 . 2.2 

shorthand, it consists of the following eight unsatisfiable clauses: 

R = {123,234,341,412,123,234,341,412}. ( 6 ) 

This set makes an excellent little test case, so we will refer to it frequently below. 
(The letter R reminds us that it is based on R. L. Rivest’s associative block design 
6 . 5 -( 13 ).) The first seven clauses of namely 

R' - {123,234,341,412,123,234,34l}, ( 7 ) 

also make nice test data; they are satisfied only by choosing the complements of 
the literals in the omitted clause，namely {4 ， I ， 2}. More precisely，the literals 
4, I， and 2 are necessary and sufficient to cover i ?’； we can also include either 3 
or 3 in the solution. Notice that ( 6 ) is symmetric under the cyclic permutation 
l-^2^3—)^4—> 1 —> 2 —)*3-^4-^lof literals; thus, omitting any clause 
of ( 6 ) gives a satisfiability problem equivalent to ( 7 ). 

A simple example. SAT solvers are important because an enormous variety 
of problems can readily be formulated Booleanwise as ANDs of ORs. Let’s begin 
with a little puzzle that leads to an instructive family of example problems: 
Find a binary sequence x\ .. .xs that has no three equally spaced Os and no 
three equally spaced Is. For example，the sequence 01001011 almost works; but 
it doesn’t qualify，because x‘ 2 , $ 5 , and xg are equally spaced Is. 

If we try to solve this puzzle by backtracking manually through all 8 -bit 
sequences in lexicographic order，we see that X 1 X 2 = 00 forces X 3 = 1. Then 

= 0010011 leaves us with no choice for xs- A minute or two of 
further hand calculation reveals that the puzzle has just six solutions，namely 

00110011 , 01011010 , 01100110 , 10011001 , 10100101 , 11001100 . ( 8 ) 

Furthermore it’s easy to see that none of these solutions can be extended to a 
suitable binary sequence of length 9. We conclude that every binary sequence 
xi .. .xq contains three equally spaced 0 s or three equally spaced Is. 

Notice now that the condition / 111 is the same as the Boolean 

clause (x 2 V ^5 V xg), namely 258. Similarly X 2 X^xs / 000 is the same as 258. 
So we have just verified that the following 32 clauses are unsatisfiable: 

123,234,, 789,135,246,, 579,147,258,369,159, 

123,234,, 789,135,246,, 579,147,258,369,159. ( 9 ) 

This result is a special case of a general fact that holds for any given positive 
integers j and k: If n is sufficiently large, every binary sequence x\ .. .x n contains 
either j equally spaced 0s or k equally spaced Is. The smallest such n is denoted 
by Wk) in honor of B. L. van der Waerden, who proved an even more general 
result (see exercise 2.3.4.3-6): If n is sufficiently large，and if ko, … ， kb_i are 
positive integers, every b-ary sequence x\ .. .x n contains k a equally spaced a，s 
for some digit 0 < a < b. The least such n is • • • ， 

Let us accordingly define the following set of clauses when jf ， k，n > 0: 

waerden(j,k;n) = {(^ V x i+d V ... V x i+(J •一 *) j l<i <n - (j - V) d, d > 1} 

U {(而 V x^d V ... V x i+ ^ k _ 1)d ) I 1 < i < n - d > l}. ( 10 ) 


Rivest 

associative block design 
equally spaced occurrences 
arithmetic progressions 
backtracking 
lexicographic order 
van der Waerden 

W(ko ,.. 
waerden^ 


September 23, 2015 







7 . 2 . 2.2 SATISFIABILITY: EXAMPLE APPLICATIONS 5 

The 32 clauses in ( 9 ) are waerden(3^ 3; 9); and in general waerden(j^k]n) is an 
appealing instance of SAT ， satisfiable if and only if n < W(j^ k). 

It’s obvious that W(l^ k) = k and W(2^ k) = 2k — [A:even]; but when j and k 
exceed 2 the numbers W(j^ k) are quite mysterious. We’ve seen that W (3,3) = 9, 
and the following nontrivial values are currently known: 

二 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 

W(3, k)= 9 18 22 32 46 58 77 97 114 135 160 186 218 238 279 312 349 

W(4, k) - 18 35 55 73 109 146 309 ????????? ? 

W(5, k) = 22 55 178 206 260 ???? ? ?????? ? 

W"( 6 , fc) 二 32 73 206 1132 ????????????? 

V. Chvatal inaugurated the study k) by computing the values for j+k < 9 

as well as VF(3, 7) [Combinatorial Structures and Their Applications (1970) ， 31— 
33]. Most of the large values in this table have been calculated by state-of-the-art 
SAT solvers [see M. Kouril and J. L. Paul, Experimental Math. 17 (2008), 53- 
61; M. Kouril, Integers 12 (2012), A46:l-A46:13]. The table entries for j = 3 
suggest that we might have W(3^ k) < k 2 when k > 4, but that isn’t true: SAT 
solvers have also been used to establish the lower bounds 

k = 20 21 22 23 24 25 26 27 28 29 30 
W(3,k)> 389 416 464 516 593 656 727 770 827 868 903 

(which might in fact be the true values for this range of k)] see T. Ahmed, 
O. Kullmann, and H. Snevily [Discrete Applied Math. 174 (2014), 27-51]. 

Notice that the literals in every clause of waerden(j^ k; n) have the same 
sign: They’re either all positive or all negative. Does this “monotonic” property 
make the SAT problem any easier? Unfortunately, no: Exercise 10 proves that 
any set of clauses can be converted to an equivalent set of monotonic clauses. 

Exact covering. The exact cover problems that we solved with “dancing links” 
in Section 7.2.2.1 can easily be reformulated as instances of SAT and handed off 
to SAT solvers. For example, let’s look again at Langford pairs，the task of 
placing two Is, two 2 s, •…， two n’s into 2 n slots so that exactly k slots intervene 
between the two appearances of for each k. The corresponding exact cover 
problem when n = 3 has nine columns and eight rows (see 7.2.2.1-(oo)): 

d\ Si S 3 , d\ S 2 S 4 , d\ S 3 55 , d\ S 4 sq , d。s 1 s 4 ， ^2 ^2 ^ 5 1 ^2 ^3 ^ 6 1 ^3 ^1 ^5 • ( 11 ) 

The columns are ^ for 1 < i < 3 and Sj for 1 < j < 6 ; the row c di Sj Sk means 
that digit i is placed in slots j and k. Left-right symmetry allows us to omit the 
row c ds S 2 Sq from this specification. 

We want to select rows of ( 11 ) so that each column appears just once. Let 
the Boolean variable Xj mean c select row j\ for 1 < j < 8 ; the problem is then 
to satisfy the nine constraints 

S 1 (x 1 ,X 2 ,xs,x 4 ) A S 1 (x 5 ,x 6 ,x 7 ) A (^ 8 ) 

A 51(^1^5^ 8 ) A 6^02 ， $6) A (X!,XS,X 7 ) 

A Si(x 2 ,x 4 ,x 5 ) A Si(x 3 ,x 6 ,xs) A Sx(x 4 ,x 7 ), ( 12 ) 


Chvatal 

Kouril 

Paul 

Kouril 

Ahmed 

Kullmann 

Snevily 

monotonic clauses 
exact cover problems 
dancing links 
Langford pairs 
symmetry 
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one for each column. (Here, as usual, • • ”y p ) denotes the symmetric 

function [yi +•••+% = ]_]•) For example，we must have : r 5 + 仰 + 巧 = 1 ， 
because column 办 appears in rows 5, 6 , and 7 of (n). 

One of the simplest ways to express the symmetric Boolean function Si as 
an AND of ORs is to use 1 + (? 2 ) clauses: 

Siiyi ， ... ， yp) = {yiH y P ) f\ (yj v y k ). ( 13 ) 

l<j<k<p 

“At least one of the y^s is true，but not two •” Then ( 12 ) becomes，in shorthand 5 

{1234,12,13,14,23,24,34,567,56,57,67,8, 

158,15,18,58,26,26,137,13,17,37, 

245,24,25,45,368,36,38,68,47,47}; ( 14 ) 

we shall call these clauses langford(3). (Notice that only 30 of them are actually 
distinct，because 13 and 24 appear twice.) Exercise 13 defines langford(n)' we 
know from exercise 7-1 that langford(n) is satisfiable nmod4 = 0 or 3. 

The unary clause 8 in ( 14 ) tells us immediately that xg = 1. Then from the 
binary clauses IS ， 5 §， 38, 68 we have xi = = xs = xq = 0. The ternary clause 

137 then implies X 7 = 1; finally X 4 = 0 (from 47) and X 2 = 1 (from 1234). Rows 
8 , 7, and 2 of ( 11 ) now give us the desired Langford pairing 312132. 
Incidentally, the function Si (yi ， 2 / 2 , 2 / 3 , 2 / 4 , 2 / 5 ) can also be expressed as 

(yi v 2/2 v 2/3 v 2/4 v y h ) A (yi V y 2 ) A (yi V y 3 ) A (yi V Q 

A(y 2 Vy 3 ) A(y 2 Vt) A(y 3 Vt) A(tVy 4 ) A(tVy 5 ) A(y 4 Vy 5 ), 

where t is a new variable. In general, if p gets big, it’s possible to express 
S 认 yi, … ， Up) with only 3p — 5 clauses instead of ⑸ + 1， by using [(p — S) / 2 J new 
variables as explained in exercise 12. When this alternative encoding is used to 
represent Langford pairs of order n ， we’ll call the resulting clauses langford 1 (n). 

Do SAT solvers do a better job with the clauses langford (n) or langford 1 {n)l 
Stay tuned: We’ll find out later. 

Coloring a graph. The classical problem of coloring a graph with at most d 
colors is another rich source of benchmark examples for SAT solvers. If the graph 
has n vertices we can introduce nd variables Vj, for v ^ V and 1 < j < 
signifying that v has color j ; the resulting clauses are quite simple: 

(仍 V V … V Vd) ior v (“every vertex has at least one color”）; ( 15 ) 
(uj V Vj) for u —— I < j < d (“adjacent vertices have different colors”）. ( 16 ) 

We could also add n(^) additional so-called exclusion clauses 

(vi V Vj) for v ^ I <i < j <d (“every vertex has at most one color”）; ( 17 ) 

but they’re optional, because vertices with more than one color are harmless. 
Indeed，if we find a solution with v\ = V 2 = 1, well be extra happy, because it 
gives us two legal ways to color vertex v. (See exercise 14.) 


^1 {yi ，•••，％) 

symmetric Boolean function 

langford (n) 

unary clause 
binary clauses 
ternary clause 
encoding 
Langford pairs 

langford 1 (n) 

coloring a graph 
exclusion clauses 
at-most-one 
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Fig. 33. The McGregor graph 
of order 10- Each region of this 
“map” is identified by a two- 
digit hexadecimal code. Can you 
color the regions with four colors ， 
never using the same color for 
two adjacent regions? 


00 
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02 

03 

04 

05 

06 

07 

08 


11 

12 

13 

14 

15 

16 

17 

18 

1 



22 

23 

24 

25 

26 

27 

28 

29 




33 

34 

35 

36 

37 

38 

39 






44 

45 

46 

47 

48 

49 








55 

56 

57 

58 

59 










66 

67 

68 

69 












77 

78 

79 














88 

89 
















QQ 









09 


20 


30 


40 


50 


60 


70 


80 


90 


aO 


21 


31 


41 


51 


71 


91 
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61 

62 

63 

64 


74 


94 


65 


75 


95 


76 


81 

82 

83 

84 

85 

86 


96 


87 


97 


98 


al 

a2 

a3 

a4 

a5 

a6 

a7 

a8 


a9 


10 


Gardner 

chess 

da Vinci 

RipofF 

April Fool 

Four Color Theorem 

McGregor 

Bryant 

independent 


Martin Gardner astonished the world in 1975 when he reported [Scientific 
American 232, 4 (April 1975) ， 126-130] that a proper coloring of the planar 
map in Fig. 33 requires five distinct colors，thereby disproving the longstanding 
four-color conjecture. (In that same column he also cited several other “facts” 
supposedly discovered in 1974: (i) is an integer; (ii) pawn-to - king-rook-4 

( c h4’）is a winning first move in chess; (iii) the theory of special relativity is 
fatally flawed; (iv) Leonardo da Vinci invented the flush toilet; and (v) Robert 
RipofF devised a motor that is powered entirely by psychic energy. Thousands 
of readers failed to notice that they had been April Fooled!) 

The map in Fig. 33 actually can be 4 - colored; you are hereby challenged to 
discover a suitable way to do this，before turning to the answer of exercise 18. 
Indeed，the four - color conjecture became the Four Color Theorem in 1976， as 
mentioned in Section 7. Fortunately that result was still unknown in April of 
1975; otherwise this interesting graph would probably never have appeared in 
print. McGregor’s graph has 110 vertices (regions) and 324 edges (adjacencies 
between regions); hence ( 15 ) and ( 16 ) yield 110 + 1296 = 1406 clauses on 440 
variables，which a modern SAT solver can polish off quickly. 

We can also go much further and solve problems that would be extremely 
difficult by hand. For example, we can add constraints to limit the number of 
regions that receive a particular color. Randal Bryant exploited this idea in 2010 
to discover that there’s a four-coloring of Fig. 33 that uses one of the colors only 
7 times (see exercise 17). His coloring is，in fact, unique, and it leads to an 
explicit way to 4 - color the McGregor graphs of all orders n > 3 (exercise 18). 

Such additional constraints can be generated in many ways. We could ， 
for instance，append (^ 0 ) clauses，one for every choice of 8 regions, specifying 
that those 8 regions aren’t all colored 1. But no, we’d better scratch that idea: 

= 409,705,619,895. Even if we restricted ourselves to the 74,792,876,790 

sets of 8 regions that are independent, we’d be dealing with far too many clauses. 
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An interesting SAT - oriented way to ensure that x\ +- h x n is at most r ， 

which works well when n and r are rather large，was found by C. Sinz [LNCS 
3709 (2005), 827-831]. His method introduces (n — r)r new variables Sj for 

1 < j < n — r and 1 < k < r. If F is any satisfiability problem and if we add the 
(n — r — l)r + (n — r) (r + 1 ) clauses 


(芍 v 心)， 

(xj^Ws^Ws^ 1 ) 


for 1 < j < n — r and 1 < A: < r, 
for 1 < jf < n — r and Q < k < r ， 


( 18 ) 

09) 


where Sj is omitted when A: = 0 and s ^ +1 is omitted when k = r, then the new set 
of clauses is satisfiable if and only if F is satisfiable with xiH - \-x n < r. (See ex¬ 

ercise 26.) With this scheme we can limit the number of red-colored regions of 
McGregor’s graph to at most 7 by appending 1538 clauses in 721 new variables. 
Another way to achieve the same goal, which turns out to be even better, 

has been proposed by O. Bailleux and Y. Boufkhad [LNCS 2833 (2003), 108- 
122]. Their method is a bit more difficult to describe，but still easy to implement: 
Consider a complete binary tree that has n — 1 internal nodes numbered 1 through 
n — 1 ， and n leaves numbered n through 2 n — 1 ; the children of node for 
1 < k < are nodes 2k and 2&+1 (see 2.3.4.5-( 5 )). We form new variables bj for 
1 < k < n and 1 < j < t/c, where tk is the minimum of r and the number of leaves 
below node k. Then the following clauses, explained in exercise 27, do the job: 


+ 1 V&U for 0 <i<t 2 k, 0 <i<t 2 ^+i, l<k<n; ( 20 ) 

(b'i V b 3 j), for 0<i<t 2 , 0<jf <t 3 , i + + 1. ( 21 ) 

In these formulas we let 4 = 1 and b\ = Xk- n +i for n < A: < 2n; all literals 
and are to be omitted. Applying ( 20 ) and ( 21 ) to McGregor’s graph, with 
n = 110 and r = 7, yields just 1216 new clauses in 399 new variables. 

The same ideas apply when we want to ensure that : ri + ••• + $ n is af least r, 
because of the identity Sy r {x\^..., x n ) = 6 ^ n _ r ( 龙1， … ^x n ). And exercise 30 

considers the case of equality, when our goal is to make x\ H - hx n = r. We’ll 

discuss other encodings of such cardinality constraints below. 


Sinz 

Bailleux 

Boufkhad 

complete binary tree 
cardinality constraints 


^<r (^1 ， • • • ， ^n) 


symmetric threshold functions 
binary multiplication 
multiplication+ 


Factoring integers. Next on our agenda is a family of SAT instances with quite 
a different flavor. Given an (m + n)-bit binary integer z = ( 2 ： m + n . • • 之2之 1 ) 2， do 
there exist integers x = (x m ... ^ 1)2 and y = (y n • • • yi )2 such that z = x x y? 
For example, if m = 2 and n = 3, we want to invert the binary multiplication 


V3 V 2 yi 

X X 2 X 1 
^3 ai 

63 62 

C 3 C2 Cl 

Zh Za Z 2 2：1 


Zi = ai 

(a 3 a 2 ai) 2 = (^^2^1)2 x xi ^ Cl2：2 ^ 2 = a 2 + > 

(hhh ) 2 = immmh 乂奶 C2 々 2 = f + &2+Cl ( 22 ) 

(c 3 之 4)2 =03+02 

^5 = C 3 


when the 2： bits are given. This problem is satisfiable when z = 21 = (10101)2, 
in the sense that suitable binary values : r 1 ， X 2 , yi, 奶， 2 / 3 ，勿，奶，叱， fh, b‘ 2 ， 63 ， ci ， 
C2, C3 do satisfy these equations. But it’s unsatisfiable when z = 19 = (10011)2. 
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Arithmetical calculations like (22) are easily expressed in terms of clauses 
that can be fed to a SAT solver: We first specify the computation by constructing 
a Boolean chain, then we encode each step of the chain in terms of a few clauses. 
One such chain, if we identify a\ with zi and C3 with 2:5, is 

zi ^xiAyi, bi ^2 Ayi, 2 ： 2^a206i, s<-a 3 ㊉ b‘2, Z 3 <—sQci, 2:4^63©^, 

a 2 <—xiAy 2 , b 2 <—X 2 Ay 2 , c 十 ( 22 八 h, p—<23 八 62 ， q^-sAci, 之 5“3 八 <^2 ， 

a 3 Ays, 6 3 ^^ 2 Ay 3 , c 2 ^pVg, (23) 

using a “full adder” to compute C2Zs and “half adders” to compute c\ Z2 and C3 2:4 

(see 7 . 1 . 2 —(23) and (24)). And that chain is equivalent to the 49 clauses 

(^iV^)A(yiVzi)A(^iVyiVz 1 )A---A( 63 Vc 2 Vz 4 )A( 63 Vz 5 )A(c 2 Vz 5 )A( 6 3 Vc 2 Vz 5 ) 

obtained by expanding the elementary computations according to simple rules: 
t ^ u Av becomes (u W F) A (v W t) A (u W v \/ t)] 

t u\/ v becomes (u W t) A (v W t) A (u W v \/ t)] (24) 

t u®v becomes (uWvWt) A (uWvWt) A (u\/v\/t)A(u\/v\/t). 

To complete the specification of this factoring problem when ， say, 2 ： = (10101)2, 
we simply append the unary clauses (Z 5 ) 八 ( 乏 4) 八 ( 勿 ) 八 ( 乏 2) 八 (^i). 

Logicians have known for a long time that computational steps can readily 
be expressed as conjunctions of clauses. Rules such as (24) are now called Tseytin 
encoding, after Gregory Tseytin ( 1966 ). Our representation of a small five-bit 
factorization problem in 49+5 clauses may not seem very efficient; but we will see 
shortly that m-bit by n-bit factorization corresponds to a satisfiability problem 
with fewer than 6mn variables, and fewer than 20 mn clauses of length 3 or less. 

Even if the system has hundreds or thousands of formulas, 
it can be put into conjunctive normal form “piece by piece,” 

without any “multiplying out ■” 

— MARTIN DAVIS and HILARY PUTNAM (1958) 

Suppose m <n. The easiest way to set up Boolean chains for multiplication 
is probably to use a scheme that goes back to John Napier’s Rabdologise (Edin¬ 
burgh, 1617)，pages 137— 143， as modernized by Luigi Dadda [Alta Frequenza 
34 (1964) ， 349-356] : First we form all mn products Xi A yj, putting every such 
bit into bin[i + j], which is one of m + n “bins” that hold bits to be added 
for a particular power of 2 in the binary number system. The bins will contain 
respectively ( 0 , 1 ， 2 ,… ， m ， m, … ， m ， … ， 2 , 1 ) bits at this point, with n —m +1 
occurrences of “m” in the middle. Now we look at bin [k] for A: = 2, 3，…… If 
bin [k] contains a single bit 6 ， we simply set Zk-i ^ b. If it contains two bits 
{ 6 , 6 ’}，we use a half adder to compute Zk-i 1 6 ㊉ 6 ’， c 1 6 八 6 ’， and we put the 
carry bit c into bin [^ + 1]. Otherwise bin [k] contains t > 3 bits; we choose any 
three of them，say {b ， b’, 6 "}，and remove them from the bin. With a full adder we 
then compute r ^ 6 ㊉ 以 ㊉ 心 " and c ^ (Wb"), so that b + b’ + b" = r+ 2 c; and we 
put r into bin [k] , cinto bin [A:+l]. This decreases t by 2, so eventually we will have 
computed Zk-i. Exercise 41 quantifies the exact amount of calculation involved. 


Boolean chain 

full adder 

half adders 

unary clauses 

Tseytin encoding 

conjunctive normal form 

DAVIS 

PUTNAM 


Napier 

Dadda 

binary number system 
half adder 
carry bit 
full adder 


notation (tt6c) 


median operation 
ternary operations 
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This method of encoding multiplication into clauses is quite flexible，since 
we’re allowed to choose any three bits from bin [k] whenever four or more bits are 
present. We could use a first-in - first-out strategy, always selecting bits from the 
“rear” and placing their sum at the “front ”； or we could work last-in - first-out, 
essentially treating bin[k] as a stack instead of a queue. We could also select 
the bits randomly，to see if this makes our SAT solver any happier. Later in this 
section we’ll refer to the clauses that represent the factoring problem by calling 
them factor^fifo(m^ n, z\ factor or factor 一 rand (m ， n ， z ， s) ， respec¬ 
tively, where 5 is a seed for the random number generator used to generate them. 

It’s somewhat mind-boggling to realize that numbers can be factored without 
using any number theory! No greatest common divisors，no applications of 
Fermat’s theorems ， etc” are anywhere in sight. We’re providing no hints to 
the solver except for a bunch of Boolean formulas that operate almost blindly 
at the bit level. Yet factors are found. 

Of course we can’t expect this method to compete with the sophisticated 
factorization algorithms of Section 4.5.4. But the problem of factoring does dem¬ 
onstrate the great versatility of clauses. And its clauses can be combined with 
other constraints that go well beyond any of the problems weVe studied before. 


Fault testing. Lots of things can go wrong when computer chips are manufac¬ 
tured in the “real world,” so engineers have long been interested in constructing 
test patterns to check the validity of a particular circuit. For example，suppose 
that all but one of the logical elements are functioning properly in some chip; the 
bad one ， however, is stuck: Its output is constant，always the same regardless of 
the inputs that it is given. Such a failure is called a single-stuck-at fault 


2/3 2/2 Vl X2 Xl 



Fig. 34. A circuit that 
corresponds to ( 23 ). 


Figure 34 illustrates a typical digital circuit in 
detail: It implements the 15 Boolean operations 
of ( 23 ) as a network that produces five output sig¬ 
nals 2 : 52 : 42 : 32 : 22:1 from the five inputs ^ 3 ^ 22 / 1 ^ 2 ^ 1 - 
In addition to having 15 AND ， OR ， and XOR gates ， 
each of which transforms two inputs into one out¬ 
put, it has 15 “fanout” gates (indicated by dots 
at junction points), each of which splits one input 
into two outputs. As a result it comprises 50 
potentially distinct logical signals，one for each 
internal “wire •” Exercise 47 shows that a circuit 
with m outputs, n inputs, and g conventional 2 - 
to -1 gates will have g + m — n fanout gates and 
3g + 2m — n wires. A circuit with w wires has 2w 
possible single-stuck-at faults，namely w faults in 
which the signal on a wire is stuck at 0 and w 
more on which it is stuck at 1 . 

Table 1 shows 101 scenarios that are possible 
when the 50 wires of Fig. 34 are activated by one 
particular sequence of inputs, assuming that at 


first-in-first-out 

FIFO: first in first out 

last-in-first-out 

stack 

queue 

factor 一 fifo (m ， n ， z) 
factorJifo (m ， n ， z) 
factor—rand (m ， n ， 2 ：， s) 

Fermat 
Fault testing 

ATPG ： Automatic test pattern generation, see F 

test patterns 

circuit 

circuit ： see also Boolean chain 

single-stuck-at fault 
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Table 1 


SINGLE - STUCK-AT FAULTS IN FIGURE 34 WHEN x 2 xi = 11, 2 / 32 / 22/1 = 110 


default 

bitwise 


0'Kx 1 x\x\x\x\xQ i x\x^x^x^y 1 y\y^yQ i y\y^y^y\y^z 1 a^a\a^a^a\a^b 1 b\ b\ b\ b\ b 3 63 z 2 ci c\ c\ s s 1 s 2 p z 3 q c^c\c^z^z^ 



—yi 

<~Vl 


V 2 ^V 2 

Vo<-V2 


y 3 "3 


zi 

^2 
1 
a. 

a 

^3 
1 
a . 

a 

bi 


2 

2 

2 


3 

2 

3 


^ 1 ^^2 
^2 

^ 2 ^Vi 




<~ b l 

62 ^ x 2 
^2 —^2 

b 3 <-x^Ay^ 

b 3^~ b 3 

z^^r-a\®b\ 
c 2 ^~~ct^ 
c\^c x 
cf—Ci 

S A —㊉ 


5 2 <—5 

p ^ — <1., A 
^3<-5 1 ©cJ 
<?<-5 2 AC^ 
C2<—p^q 

4—c 2 

c_—c 2 

之 4<-6& ㊉ eg 

Z 5 — 


10111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111 
1 01O1111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111 
1 0111011111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111 
1 0101110111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111 
1 0101111101111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111 
1 1111111111011111111111111111111111111111111111111111111111111111111111111111111111111111111111111111 
1 1111111111010111111111111111111111111111111111111111111111111111111111111111111111111111111111111111 
1 1111111111011101111111111111111111111111111111111111111111111111111111111111111111111111111111111111 
1 1111111111010111011111111111111111111111111111111111111111111111111111111111111111111111111111111111 
1 1111111111010111110111111111111111111111111111111111111111111111111111111111111111111111111111111111 
00000000000000000000001000000000000000000000000000000000000000000000000000000000000000000000000000000 
00000000000000000000001010000000000000000000000000000000000000000000000000000000000000000000000000000 
0 0000000000000000000001000100000000000000000000000000000000000000000000000000000000000000000000000000 
1 1111111111111111111111111101111111111111111111111111111111111111111111111111111111111111111111111111 
1 1111111111111111111111111101011111111111111111111111111111111111111111111111111111111111111111111111 
1 1111111111111111111111111101110111111111111111111111111111111111111111111111111111111111111111111111 
1 1111111111111111111111111111111101111111111111111111111111111111111111111111111111111111111111111111 
1 1111111111111111111111111111111101011111111111111111111111111111111111111111111111111111111111111111 
1 1111111111111111111111111111111101110111111111111111111111111111111111111111111111111111111111111111 
0 0000000000000000000001010000000000000001000000000000000000000000000000000000000000000000000000000000 
1 0101110111111111111111111101011111111111011111111111111111111111111111111111111111111111111111111111 
1 0101110111111111111111111101011111111111010X11111111111111111111111111111111111111111111111111111111 
1 0101110111111111111111111101011111111111011101111111111111111111111111111111111111111111111111111111 
1 0101111101111111111111111111111101011111111111011111111111111111111111111111111111111111111111111111 
1 0101111101111111111111111111111101011111111111010111111111111111111111111111111111111111111111111111 
1 0101111101111111111111111111111101011111111111011101111111111111111111111111111111111111111111111111 
0 0000000000000000000001000100000000000000000000000000010000000000000000000000000000000000000000000000 
00000000000000000000001000100000000000000000000000000010100000000000000000000000000000000000000000000 
0 0000000000000000000001000100000000000000000000000000010001000000000000000000000000000000000000000000 
1 1111111111010111011111111101110111111111111111111111111111011111111111111111111111111111111111111111 
1 1111111111010111011111111101110111111111111111111111111111010111111111111111111111111111111111111111 
1 1111111111010111011111111101110111111111111111111111111111011101111111111111111111111111111111111111 
1 1111111111010111110111111111111101110111111111111111111111111111011111111111111111111111111111111111 
1 1111111111010111110111111111111101110111111111111111111111111111010111111111111111111111111111111111 
1 1111111111010111110111111111111101110111111111111111111111111111011101111111111111111111111111111111 
1 0101110111111111111110111001011111111111010111111111101011111111111111011111111111111111111111111111 
00000000000000000000001000100000000000000000000000000010001000000000000000100000000000000000000000000 
00000000000000000000001000100000000000000000000000000010001000000000000000101000000000000000000000000 
00000000000000000000001000100000000000000000000000000010001000000000000000100010000000000000000000000 
0 1010000010101000100000000010001010100000000000101000000000101000000000000000000100000000000000000000 
0 1010000010101000100000000010001010100000000000101000000000101000000000000000000101000000000000000000 
0 1010000010101000100000000010001010100000000000101000000000101000000000000000000100010000000000000000 
1 0101111101010111011111111101110101011111111111011101111111011101111111111111111111110111111111111111 
0 1010000010101000100001000110001010100000000000101000010001101000000000000101000101000001000000000000 
0 0000000000000000000000000000000000000000000000000000000000000000000000000000000000000000010000000000 
1 0101111101010111011111111101110101011111111111011101111111011101111111111111111111110111110111111111 
1 0101111101010111011111111101110101011111111111011101111111011101111111111111111111110111110101111111 
1 0101111101010111011111111101110101011111111111011101111111011101111111111111111111110111110111011111 
0 1010000010000000101000000010001000101000000000100010000000100010101000000000000000001000001010000100 
1 0101111101010111010111111101110101010111111111011101111111011101011101111111111111110111110111011101 


most one stuck - at fault is present. The column headed OK shows the correct 
behavior of the Boolean chain (which nicely multiplies x = 3 by y = 6 and 
obtains 2 ： = 18). We can call these the “default” values, because, well, they have 
no faults. The other 100 columns show what happens if all but one of the 50 
wires have error-free signals; the two columns under b\^ for example，illustrate 
the results when the rightmost wire that fans out from gate 62 is stuck at 0 
or 1. Each row is obtained bitwise from previous rows or inputs, except that the 
boldface digits are forced. When a boldface value agrees with the default, its 
entire column is correct; otherwise errors might propagate. All values above the 
bold diagonal match the defaults. 

If we want to test a chip that has n inputs and m outputs, we’re allowed 
to apply test patterns to the inputs and see what outputs are produced. Close 
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inspection shows, for instance, that the pattern considered in Table 1 doesn’t 
detect an error when q is stuck at 1 , even though q should be 0 , because all five 
output bits Z 5 Z 4 ZSZ 2 Z 1 are correct in spite of that error. In fact, the value of 
ci ^ p\/ q is unaffected by a bad g，because p = 1 in this example. Similarly, 
the fault u x\ stuck at 0’’ doesn’t propagate into zi < - x\ A y\ because y\ = 0. 
Altogether 44 faults，not 50, are discovered by this particular test pattern. 

All of the relevant repeatable faults，whether they’re single-stuck-at or wildly 
complicated，could obviously be discovered by testing all 2 n possible patterns. 
But that’s out of the question unless n is quite small. Fortunately，testing isn’t 
hopeless，because satisfactory results are usually obtained in practice if we do 
have enough test patterns to detect all of the detectable single-stuck-at faults. 
Exercise 49 shows that just five patterns suffice to certify Fig. 34 by this criterion. 

The detailed analysis in exercise 49 also shows ， surprisingly，that one of the 
faults，namely u s 2 stuck at 1，” cannot be detected! Indeed, an erroneous s 2 can 
propagate to an erroneous q only if c\ = 1 ， and that forces x\ — X 2 = yi = 奶 =1; 
only two possibilities remain，and neither y 3 = 0 nor y 3 = 1 reveals the fault. 
Consequently we can simplify the circuit by removing gate g; the chain ( 23 ) 
becomes shorter, with “g s 八 ci ， C 2 l p V g” replaced by “C 2 4- p V ci.” 

Of course Fig. 34 is just a tiny little circuit, intended only to introduce the 
concept of stuck-at faults. Test patterns are needed for the much larger circuits 
that arise in real computers; and we will see that SAT solvers can help us to find 
them. Consider，for example，the generic multiplier circuit prod(m,n), which is 
part of the Stanford GraphBase. It multiplies an m-bit number x by an n-bit 
number y ， producing an (m + n) - bit product 2 ：. Furthermore, it’s a so-called 
“parallel multiplier，” with delay time 0 (log(m + n)); thus it’s much more suited 
to hardware design than methods like the factor—fifo schemes that we considered 
above，because those circuits need + n) time for carries to propagate. 

Let’s try to find test patterns that will smoke out all of the single-stuck-at 
faults in prod(32, 32)，which is a circuit of depth 33 that has 64 inputs，64 out - 
puts, 3660 AND gates, 1203 OR gates, 2145 XOR gates, and (therefore) 7008 fan- 
out gates and 21,088 wires. How can we guard it against 42,176 different faults? 

Before we construct clauses to facilitate that task, we should realize that 
most of the single-stuck-at faults are easily detected by choosing patterns at 
random，since faults usually cause big trouble and are hard to miss. Indeed ， 
choosing x = # 3243F6A8 and y = # 885A308D more or less at random already 
eliminates 14,733 cases; and (x,y) = ( # 2B7E1516, # 28AED2A6) eliminates 6,918 
more. We might as well keep doing this，because bitwise operations such as those 
in Table 1 are fast. Experience with the smaller multiplier in Fig. 34 suggests 
that we get more effective tests if we bias the inputs, choosing each bit to be 1 
with probability .9 instead of .5 (see exercise 49). A million such random inputs 
will then generate, say，243 patterns that detect all but 140 of the faults. 

Our remaining job ， then, is essentially to find 140 needles in a haystack of 
size 2 64 , after having picked 42,176 — 140 = 42,036 pieces of low-hanging fruit. 
And that’s where a SAT solver is useful. Consider, for example, the analogous 
but simpler problem of finding a test pattern for “g stuck at 0” in Fig. 34. 
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We can use the 49 clauses F derived from ( 23 ) to represent the well-behaved 
circuit; and we can imagine corresponding clauses F ! that represent the faulty 
computation, using “primed” variables z[^ q! 2 , ••• ， z’ 5 . Thus F 1 begins with 
(xi \/ z[) A(yiW z[) and ends with (63 V^); it’s like F except that the clauses 

representing q 1 4- s 7 A c[ in ( 23 ) are changed to simply q 1 (meaning that q 1 is 
stuck at 0). Then the clauses of F and together with a few more clauses to 
state that 2:1 / or • • • or 之 5 / 2 ^， will be satisfiable only by variables for which 
(^ 32 / 2^1 )2 x (^ 2 ^ 1)2 is a suitable test pattern for the given fault. 

This construction of F 1 can obviously be simplified, because z[ is identical 
to zi] any signal that differs from the correct value must be located “downstream” 
from the one-and-only fault. Let’s say that a wire is tarnished if it is the faulty 
wire or if at least one of its input wires is tarnished. We introduce new variables 
g 1 only for wires g that are tarnished. Thus，in our example, the only clauses F 1 
that are needed to extend F to a faulty companion circuit are q 1 and the clauses 
that correspond to d 2 ^ pW q 1 ^ z f 4 <- 63 ㊉ z f 5 63 A C 2 . 

Moreover 5 any fault that is revealed by a test pattern must have an active 
path of wires, leading from the fault to an output; all wires on this path must 
carry a faulty signal. Therefore Tracy Larrabee [IEEE Trans. CAD-11 (1992), 
4-15] decided to introduce additional “sharped” variables for each tarnished 
wire, meaning that g lies on the active path. The two clauses 

V^V^) A (# V 々 V 々’） ( 25 ) 

ensure that g ^ g f whenever g is part of that path. Furthermore we have V g^) 
whenever g is an AND, OR, or XOR gate with tarnished input v. Fanout gates 
are slightly tricky in this regard: When wires g 1 and g 2 fan out from a tarnished 
wire 仏 we need variables g 1 ^ and g 2 ^ as well as and we introduce the clause 

V g n V g n ) ( 26 ) 


« 4 


to specify that the active path takes at least one of the two branches. 
According to these rules，our example acquires the new variables 
z\^ and the new clauses 

(^VgVg 7 ) A (^VgVg ; ) A (^ Vc|) A(4 Vc 2 v4) A(4 Vc 2 v4) A(4vc^ Vc^) A 

(皆 V4) A (4vz 4 v 4) A(4v&Vg) A (逆 Vg) A (輔 Vq Vg) A 乏 5 V4). 


The active path begins at g，so we assert the unit clause (g^); it ends at a 
tarnished output, so we also assert V z\). The resulting set of clauses will 
find a test pattern for this fault if and only if the fault is detectable. Larrabee 
found that such active-path variables provide important clues to a SAT solver 
and significantly speed up the solution process. 

Returning to the large circuit prod (32, 32)，one of the 140 hard-to-test faults 
is u W 2 i stuck at 1 ,” where denotes the 26th extra wire that fans out from 
the OR gate called W 21 in §75 of the Stanford GraphBase program GB_GATES; 
W 21 is an input to gate 640 A W 21 in §80 of that program. Test patterns 

for that fault can be characterized by a set of 23,194 clauses in 7,082 variables 
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(of which only 4 variables are “primed” and 4 are “sharped”）. Fortunately 
the solution (x ， y) = ( # 7F13FEDD 5 # 5FE57FFE) was found rather quickly in the 
author’s experiments; and this pattern also killed off 13 of the other cases, so 
the score was now “14 down and 126 to go ”！ 

The next fault sought was 置 6 , 2 stuck at 1，” where 篇 6,2 is the second 
extra wire to fan out from the AND gate 乂慧 6 in §72 of GB.GATES (an input 
to Rff •<— Ag 6,2 A jR^ 5,2 ). This fault corresponds to 26,131 clauses on 8,342 
variables; but the SAT solver took a quick look at those clauses and decided 
almost instantly that they are unsatisfiable. Therefore the fault is undetectable, 
and the circuit prod(32^ 32) can be simplified by setting Rff ^r- 辦 5 ’ 2 . A closer 
look showed, in fact, that clauses corresponding to the Boolean equations 

x = y A y = v /\w, z — t f\u^ ^ 二 ^㊉ 忉 

were present (where t = u — v — R\ A ^ w = A\\^ x = y = 

2 ： = these clauses force x = 0. Therefore it was not surprising to find 

that the list of unresolved faults also included R^ 1 and R^ 2 stuck at 0 . 
Altogether 26 of the 140 faults undetected by random inputs turned out to be 
absolutely undetectable; and only one of these, namely “Q 翡 stuck at 0 ，” required 
a nontrivial proof of undetectability. 

Some of the 126 — 26 = 100 faults remaining on the to-do list turned out to be 
significant challenges for the SAT solver. While waiting，the author therefore had 
time to take a look at a few of the previously found solutions，and noticed that 
those patterns themselves were forming a pattern! Sure enough, the extreme por¬ 
tions of this large and complicated circuit actually have a fairly simple structure ， 
stuck-at-fault-wise. Hence number theory came to the rescue: The factorization 
# 87FBC059 x # F0F87817 = 2 63 — 1 solved many of the toughest challenges ， 
some of which occur with probability less than 2 -34 when 32-bit numbers are 
multiplied; and the “Aurifeuillian” factorization (2 31 — 2 16 + 1) (2 31 +2 16 + 1)= 
2 62 + 1， which the author had known for more than forty years (see Eq. 4.5.4- 
( 15 ))，polished off most of the others. 

The bottom line (see exercise 51) is that all 42,150 of the detectable single - 
stuck-at faults of the parallel multiplication circuit prod (32^ 32) can actually be 
detected with at most 196 well-chosen test patterns. 

Learning a Boolean function. Sometimes we’re given a “black box” that 
evaluates a Boolean function f(x\^ ... ,xn). We have no way to open the box, 
but we suspect that the function is actually quite simple. By plugging in various 
values for $ = rci …: ttv ，we can observe the box’s behavior and possibly learn the 
hidden rule that lies inside. For example，a secret function of TV = 20 Boolean 
variables might take on the values shown in Table 2, which lists 16 cases where 
f(x) = 1 and 16 cases where f(x) = 0 . 

Suppose we assume that the function has a DNF (disjunctive normal form) 
with only a few terms. We’ll see in a moment that it’s easy to express such an 
assumption as a satisfiability problem. And when the author constructed clauses 
corresponding to Table 2 and presented them to a SAT solver，he did in fact learn 
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Table 2 

VALUES TAKEN ON BY AN UNKNOWN FUNCTION 
Cases where f(x) = 1 Cases where f(x) = 0 


XiX2X3X4X^XQX t j XgXQ ••• X20 

11001001000011111101 
10101010001000100001 
01101000110000100011 
01001100010011000110 
01100010100010111000 
00001101110000011100 
1 1010001001010010000 
00100100111000001000 
10001010011001111100 
1 1 0001110100000000 1 0 
00001011101111101010 
01100011101100010011 
1 001 1 011001000 1 00101 
00010100101000001000 
01111001100011100011 
01000000010011011101 


XiX2XQX4X^XQX / j XgXQ ••• X20 

10101101111110000101 
01000101100010100010 
10111011010010101001 
10101010111111011100 
01010110001000000010 
01110011110100111100 
1 1 1 10001110110001011 
10011100010110000011 
11001110001011010011 
01101001010110101001 
11100001001101100100 
00010001010001100100 
0011001111 1 1101 1 1100 
11001001001110011101 
11001110001001001001 
10 1 100111110111 1 1001 


almost immediately that a very simple formula is consistent with all of the data: 

/(Ti ， … ，冗 20 ) = _3 无 10 V X 6 Xi 0 Xr 2 V x 8 x 13 x 15 V X 8 X 10 X 12 . ( 27 ) 

This formula was discovered by constructing clauses in 2MN variables pij 
and q、j for 1 < i < M and 1 < j < where M is the maximum number of 
terms allowed in the DNF (here M = 4) and where 

Pij = [term i contains Xj]^ q、j = [term i contains Xj]. ( 28 ) 

If the function is constrained to equal 1 at P specified points，we also use auxiliary 
variables z、k for 1 < i < M and 1 < k < P ，one for each term at every such point. 

Table 2 says that /(I ， 1,0, () ， ••• ， 1) = 1， and we can capture this specification 
by constructing the clause 

(zi,i V ^2,1 v ... V z M ,i) (29) 

together with the clauses 

A ( 右 , iV 条 , 2 ) 八 (z iA Wpi i3 ) A (z iA Wpi A ) 八 … 八 ( 右 , lV 豕 , 20 ) (3。) 

for 1 < i < M. Translation: ( 29 ) says that at least one of the terms in the DNF 
must evaluate to true; and ( 30 ) says that，if term i is true at the point 1100 … 1 ， 
it cannot contain or 无 2 or T 3 or $4 or • • • or ^ 20 - 

Table 2 also tells us that /(1 ， 0,1 ， 0,… ， 1) = 0. This specification corre¬ 
sponds to the clauses 

(^,1 v Pi ^2 v V p iA V ... V q i ： 2 o) (3 1 ) 

for 1 < i < M. (Each term of the DNF must be zero at the given point; thus 
either x\ or X 2 or xs or $4 or • • • or X 20 must be present for each value of i.) 

In general，every case where f(x) = 1 yields one clause like ( 29 ) of length M ， 
plus MN clauses like ( 30 ) of length 2. Every case where f(x) = 0 yields M 
clauses like ( 31 ) of length N. We use qij when Xj = 1 at the point in question ， 
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and pij when Xj =0， for both ( 30 ) and ( 31 ). This construction is due to 
A. P. Kamath, N. K. Karmarkar, K. G. Ramakrishnan，and M. G. C. Resende 
[Mathematical Programming 57 (1992) ， 215-238], who presented many exam¬ 
ples. From Table 2, with M = 4, iV = 20, and P = 16, it generates 1360 clauses 
of total length 3904 in 224 variables; a SAT solver then finds a solution with 
Pi,i = qi,i = Pi ,2 = o, gi ,2 = 1, , leading to ( 27 ). 

The simplicity of ( 27 ) makes it plausible that the SAT solver has indeed 
psyched out the true nature of the hidden function f(x). The chance of agreeing 
with the correct value 32 times out of 32 is only 1 in 2 32 , so we seem to have 
overwhelming evidence in favor of that equation. 

But no: Such reasoning is fallacious. The numbers in Table 2 actually arose 
in a completely different way, and Eq. ( 27 ) has essentially no credibility as a 
predictor of f(x) for any other values of x\ (See exercise 53.) The fallacy comes 
from the fact that short-DNF Boolean functions of 20 variables are not at all 
rare; there are many more than 2 32 of them. 

On the other hand, when we do know that the hidden function f(x) has 
a DNF with at most M terms (although we know nothing else about it)，the 
clauses ( 29 )—( 31 ) give us a nice way to discover those terms, provided that we 
also have a sufficiently large and unbiased “training set” of observed values. 

For example ， let’s assume that ( 27 ) actually is the function in the box. If 
we examine f(x) at 32 random points x, we don’t have enough data to make 
any deductions. But 100 random training points will almost always home in on 
the correct solution ( 27 ). This calculation typically involves 3942 clauses in 344 
variables; yet it goes quickly，needing only about 100 million accesses to memory. 

One of the author’s experiments with a 100-element training set yielded 

A 

f(Xu … ， $ 20 ) = ^2^3^10 V ^ 3 X 6 ^ 10^12 V ^8^13^15 V _ 10 $ 12 , (3 2 ) 

A 

which is close to the truth but not quite exact. (Exercise 59 proves that f(x) 
is equal to f(x) more than 97% of the time.) Further study of this example 
showed that another nine training points were enough to deduce f(x) uniquely, 
thus obtaining 100 % confidence (see exercise 61). 

Bounded model checking. Some of the most important applications of SAT 
solvers in practice are related to the verification of hardware or software, because 
designers generally want some kind of assurance that particular implementations 
correctly meet their specifications. 

A typical design can usually be modeled as a transition relation between 
Boolean vectors X = x\ .. .x n that represent the possible states of a system. We 
write X X f if state X at time t can be followed by state X f at time t + 1. 
The task in general is to study sequences of state transitions 

為 ^2 -> ••- X r , ( 33 ) 

and to decide whether or not there are sequences that have special properties. 
For example, we hope that there’s no such sequence for which Xq is an “initial 
state” and X r is an “error state ”； otherwise there^d be a bug in the design. 
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Fig. 35. Conway’s rule ( 35 ) defines these three successive transitions. 


Questions like this are readily expressed as satisfiability problems: Each 
state X t is a vector of Boolean variables x t \ ... x tn , and each transition relation 
can be represented by a set of m clauses T{X t ^X t+ i) that must be satisfied. 
These clauses T(X, X 1 ) involve 2n variables {x x ^... ^x n ^x , 1 ^... together 
with q auxiliary variables 切 1 ，…， ％} that might be needed to express Boolean 
formulas in clause form as we did with the Tseytin encodings in ( 24 ). Then the 
existence of sequence ( 33 ) is equivalent to the satisfiability of mr clauses 

T(X 0 ,X 1 ) A T(X u X 2 ) A ... A T(U) (34) 

in the n (r + 1 ) + gr variables {x t j | 0 < t < r, l<j <n}\J{y t k \ 0 <t<r 5 1 < A:<g}. 
We’ve essentially “unrolled” the sequence ( 33 ) into r copies of the transition 
relation, using variables x t j for state X t and y t k for the auxiliary quantities 
in T{X t ^X t+ i). Additional clauses can now be added to specify constraints on 
the initial state Xq and/or the final state X r ，as well as any other conditions 
that we want to impose on the sequence. 

This general setup is called “bounded model checking，” because we’re using 
it to check properties of a model (a transition relation)，and because we’re 
considering only sequences that have a bounded number of transitions, r. 

John Conway’s fascinating Game of Life provides a particularly instructive 
set of examples that illustrate basic principles of bounded model checking. The 
states X of this game are two-dimensional bitmaps，corresponding to arrays of 
square cells that are either alive (1) or dead (0). Every bitmap X has a unique 
successor X 1 , determined by the action of a simple 3x3 cellular automaton: 
Suppose cell x has the eight neighbors {$n W ,$n ， $ NE ， t w ， t e ， t sw ，$ s ，$ SE }，and 
let v — a：Nw + + ^ne + +^e +$sw +$s +^ S e be the number of neighbors that 
are alive at time t. Then x is alive at time t + 1 if and only if either (a) z/ = 3, 
or (b) z/ = 2 and x is alive at time t. Equivalently, the transition rule 

x = 2 < $NW + + $NE + $SW + + $SE < 4. ( 35 ) 

holds at every cell x. (See，for example, Fig. 35, where the live cells are black.) 

Conway called Life a “no - player game，” because it involves no strategy: 
Once an initial state Xq has been set up, all subsequent states Xi, X 2 ， •… are 
completely determined. Yet, in spite of the simple rules, he also proved that Life 
is inherently complicated and unpredictable, indeed beyond human comprehen¬ 
sion, in the sense that it is universal: Every finite^ discrete，deterministic system ， 
however complex，can be simulated faithfully by some finite initial state Xq 
of Life. [See Berlekamp ， Conway，and Guy, Winning Ways (2004)，Chapter 25.] 

In exercises 7.1.4-160 through 162, we’ve already seen some of the amazing 
Life histories that are possible, using BDD methods. And many further aspects 
of Life can be explored with SAT methods, because SAT solvers can often deal 
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with many more variables. For example ， Fig. 35 was discovered by using 7 x 15 = 
105 variables for each state X 2 , X 3 . The values of X 3 were obviously 

predetermined; but the other 105 x 3 = 315 variables had to be computed, and 
BDDs can’t handle that many. Moreover, additional variables were introduced 
to ensure that the initial state X 0 would have as few live cells as possible. 

Here’s the story behind Fig. 35, in more detail: Since Life is two-dimensional ， 
we use variables instead of Xj to indicate the states of individual cells, and x t ij 
instead of x t j to indicate the states of cells at time t. We generally assume that 
x t ij = 0 for all cells outside of a given finite region，although the transition rule 
(35) can allow cells that are arbitrarily far away to become alive as Life goes on. 
In Fig. 35 the region was specified to be a 7 x 15 rectangle at each unit of time. 
Furthermore, configurations with three consecutive live cells on a boundary edge 
were forbidden, so that cells “outside the box” wouldn’t be activated. 

The transitions T(X t ^ X t+ i) can be encoded without introducing additional 
variables, but only if we introduce 190 rather long clauses for each cell not on the 
boundary. There’s a better way, based on the binary tree approach underlying 
(20) and (2i) above，which requires only about 63 clauses of size $ 3, together 
with about 14 auxiliary variables per cell. This approach (see exercise 65) takes 
advantage of the fact that many intermediate calculations can be shared. For 
example，cells x and x w have four neighbors {tn W ， Tn ， t sw ， :t s } in common; so 
we need to compute x NW + + x sw + x s only once，not twice. 

The clauses that correspond to a four-step sequence X 0 ^ X 2 -> 

X 3 —)• X 4 leading to X 4 = LIFE turn out to be unsatisfiable without going 
outside of the 7 x 15 frame. (Only 10 gigamems of calculation were needed to 
establish this fact，using Algorithm C below, even though roughly 34000 clauses 
in 9000 variables needed to be examined!) So the next step in the preparation 
of Fig. 35 was to try X 3 = LIFE; and this trial succeeded. Additional clauses, 
which permitted X 0 to have at most 39 live cells，led to the solution shown, at a 
cost of about 17 gigamems; and that solution is optimum, because a further run 
(costing 12 gigamems) proved that there’s no solution with at most 38. 

Let’s look for a moment at some of the patterns that can occur on a 
chessboard, an 8 x 8 grid. Human beings will never be able to contemplate more 
than a tiny fraction of the 2 64 states that are possible; so we can be fairly sure 
that “Lifenthusiasts” haven’t already explored every tantalizing configuration 
that exists，even on such a small playing field. 

One nice way to look for a sequence of interesting Life transitions is to assert 
that no cell stays alive more than four steps in a row. Let us therefore say that 
a mobile Life path is a sequence of transitions X 0 —> Xi ^ X r with the 

additional property that we have 

(xtij v X (w)ij V x {W )ij V V ~+4)ij), for 0 < t < r - 4. (36) 

To avoid trivial solutions we also insist that X r is not entirely dead. For example ， 
if we impose rule (36) on a chessboard，with x t ij permitted to be alive only if 
I < i^j <8, and with the further condition that at most five cells are alive in each 
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generation, a SAT solver can quickly discover interesting mobile paths such as 

圍 4 圍 4 圍 4 圍 4 圍 4 圍 4 圍 4 圍 +.. ， (耵) 

which last quite awhile before leaving the board. And indeed，the five - celled 
object that moves so gracefully in this path is R. K. Guy’s famous glider (1970 )， 
which is surely the most interesting small creature in Life’s universe. The glider 
moves diagonally, recreating a shifted copy of itself after every four steps. 

Interesting mobile paths appear also if we restrict the population at each 
time to {6, 7,8, 9, 10} instead of {1 ， 2,3,4, 5}. For example，here are some of the 
first such paths that the author’s solver came up with, having length r = 8: 



These paths illustrate the fact that symmetry can be gained, but never lost, as 
Life evolves deterministically. Marvelous designs are spawned in the process. 
In each of these sequences the next bitmap, X 9 , would break our ground rules: 
The population immediately after X 8 grows to 12 in the first and last examples, 
but shrinks to 5 in the second - from-last; and the path becomes immobile in the 
other two. Indeed，we have X 5 = X 7 in the second example，hence Xq = X 8 
and X 飞二 Xg, etc. Such a repeating pattern is called an oscillator of period 2. 
The third example ends with an oscillator of period 1， known as a “still life •” 

What are the ultimate destinations of these paths? The first one becomes 
still，with Xqq = X 70 ; and the fourth becomes very still，with X 12 = 0! The 
fifth is the most fascinating of the group, because it continues to produce ever 
more elaborate valentine shapes，then proceeds to dance and sparkle，until finally 
beginning to twinkle with period 2 starting at time 177. Thus its members X 2 
through X 7 qualify as “Methuselahs，” defined by Martin Gardner as “Life pat¬ 
terns of population less than 10 that do not become stable within 50 generations •” 
(A predictable pattern，like the glider or an oscillator，is called stable.) 

SAT solvers are basically useless for the study of Methuselahs, because the 
state space becomes too large. But they are quite helpful when we want to 
illuminate many other aspects of Life, and exercises 66—85 discuss some notable 
instances. We will consider one more instructive example before moving on ， 
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namely an application to “eaters •” Consider a Life path of the form 

I ! I 

T7T7T7 — — — — TT — T7T7T7T7T7 — — TT TT TT ~ 77 77 77 77 | | | — | 

4 1111 -> 111 ^ __ 4 !^1；-> —= 又5， （38) 

1 lii l?l?l?l?l?l?l?itl l?l?l?l?l?l?l?1?i 1?1?1?1?1?1?1?"H 111111 Mi 11 1111 li 

where the gray cells form a still life and the cells of Xi^ X 2 ^ X 3 are unknown. 

Thus X 4 = X 5 and X 0 = X 5 + glider. Furthermore we require that the still 
life X 5 does not interact with the glider’s parent, H; see exercise 77. The idea 
is that a glider will be gobbled up if it happens to glide into this particular still 
life, and the still life will rapidly reconstitute itself as if nothing had happened. 
Algorithm C almost instantaneously (well, after about 100 megamems) finds 



the four-step eater first observed in action by R. W. Gosper in 1971. 


Applications to mutual exclusion. Let’s look now at how bounded model 
checking can help us to prove that algorithms are correct. (Or incorrect.) Some 
of the most challenging issues of verification arise when we consider parallel 
processes that need to synchronize their concurrent behavior. To simplify our 
discussion it will be convenient to tell a little story about Alice and Bob. 

Alice and Bob are casual friends who share an apartment. One of their joint 
rooms is special: When they’re in that critical room，which has two doors, they 
don’t want the other person to be present. Furthermore, being busy people, they 
don’t want to interrupt each other needlessly. So they agree to control access to 
the room by using an indicator light, which can be switched on or off. 

The first protocol they tried can be characterized by symmetrical algorithms: 

A0. Maybe go to Al. B0. Maybe go to Bl. 

Al. If l go to Al, else to A2. Bl. If l go to Bl, else to B2. 

A2. Set Z i 1， go to A3. B2. Set Z 1 1， go to B3. ( 40 ) 

A3 - Critical, go to A4. B3. Critical, go to B4. 

A4. Set Z 1 0, go to A0. B4. Set Z —• 0, go to B0. 

At any instant of time，Alice is in one of five states, {A0, Al，A2, A3, A4}，and 
the rules of her program show how that state might change. In state A0 she isn’t 
interested in the critical room; but she goes to Al when she does wish to use it. 
She reaches that objective in state A3. Similar remarks apply to Bob. When 
the indicator light is on (1 = 1 )，they wait until the other person has exited the 
room and switched the light back off (1 = 0). 

Alice and Bob don’t necessarily operate at the same speed. But they’re 
allowed to dawdle only when in the “maybe” state A0 or B0. More precisely，we 
model the situation by converting every relevant scenario into a discrete sequence 
of state transitions. At every time t = 0^ 1， 2， either Alice or Bob (but not 
both) will perform the command associated with their current state，thereby per¬ 
haps changing to a different state at time t + 1. This choice is nondeterministic. 

Only four kinds of primitive commands are permitted in the procedures we 
shall study，all of which are illustrated in ( 40 ): (1) “Maybe go to s” ； (2) “Critical， 


eaters 

Gosper 

parallel processes 



Bob- 

“maybe” state 
nondeterministic 


September 23, 2015 





























7.2.2.2 


SATISFIABILITY: EXAMPLE APPLICATIONS 


21 


go to s ”； (3) “Set v <- b, go to s ”； and (4) u If v go to si，else to so”. Here s 
denotes a state name, v denotes a shared Boolean variable, and 6 is 0 or 1. 

Unfortunately, Alice and Bob soon learned that protocol ( 40 ) is unreliable: 
One day she went from A1 to A2 and he went from B1 to B2, before either of 
them had switched the indicator on. Embarrassment (A3 and B3) followed. 

They could have discovered this problem in advance, if they’d converted the 
state transitions of ( 40 ) into clauses for bounded model checking, as in ( 33 )，then 
applied a SAT solver. In this case the vector X t that corresponds to time t con¬ 
sists of Boolean variables that encode each of their current states，as well as the 
current value of 1. We can, for example, have eleven variables AOt, A2 t , A3t, 
A4 t ， BOt, B2 t ， B3t, B4 t ， 心 ， together with ten binary exclusion clauses (AOt V 
Al t ), (AOt V k2t), … ， (A3t V A4 t ) to ensure that Alice is in at most one state ， 
and with ten similar clauses for Bob. There’s also a variable ， which is true or 
false depending on whether Alice or Bob executes their program step at time t. 
(We say that Alice was “bumped” if = 1 ， and Bob was bumped if = 0-) 

If we start with the initial state Xq defined by unit clauses 

AOq A AIq A A 2 q A A 3 q A A 4 q A BOq A BIq A B 2 q A B 3 q A B 4o A 『0， （4 1 ) 


the following clauses for 0 < t < r (discussed in exercise 87) will emulate the 
first r steps of every legitimate scenario defined by ( 40 ): 


(@ t V AO^V 
(@ t V Ally 

(@ t v A2 lV 

(@ t V A37 v 
(@ t V A4^V 

(@^ v bo^v 

(gVBlIV 
(gVB 27 v 
(@ ； VB37v 
(@^v B4^V 


AO t+ i) 

Alt+i) 

A 2 t +i) 

A3f+i) 

A4 t+1 ) 

BO t+ i) 

Blt+i) 

B2 t+ i) 

B3 t+ i) 

B4 t+ i) 


(5 

(座 

(座 

(座 

(座 

(5 

(座 

(座 

(座 

(瓦 


V A07 

V AI7 
v AT7 
v A 27 
v A2 ； 

V A3I 

V A47 

V A47 

V 

Vlt V 


V AO t+ i VAl t+ i) 

V Al w ) 

V A2 t+ i) 

V A3 t+ i) 

V k+i) 

v A4 t+1 ) 

V AO t+ i) 

V _ 

A2 t VA4 t vTt+i) 
A2 t VA4 t V/ t+ i) 


(@ t V B0 t V BO 奸 1 V Bl t+1 ) 

(@ t vm 7 vr t vBi t+ i) 

(@ t vm7v/ t vB 2 t+1 ) 

(@ t VB27VB3 t+ i) 

(@ t VB 2 ^V^ + i) 

(@ t VB3lVB4 t+1 ) 

(@ t VB47VB0 t+ i) 

(@t 


(42) 


VB 47 vr t + i ) 

VB2 t VB4 f V/ t+ i) 
VB2 t VB4 t V^+i) 


If we now add the unit clauses (A3 r ) and (B3 r ), the resulting set of 13 + 50r 
clauses in ll+ 12 r variables is readily satisfiable when r = 6 , thereby proving that 
the critical room might indeed be jointly occupied. (Incidentally, standard termi¬ 
nology for mutual exclusion protocols would say that “two threads concurrently 
execute a critical section” •, but we shall continue with our roommate metaphor.) 

Back at the drawing board, one idea is to modify ( 40 ) by letting Alice use 
the room only when l = 1, but letting Bob in when l = 0: 


AO. Maybe go to Al. 

Al. If l go to A2, else to Al. 
A2. Critical, go to A3. 

A3. Set / 0, go to AO. 


BO. Maybe go to Bl. 

Bl. If l go to Bl, else to B2. 
B2. Critical, go to B3. 

B3. Set Z 1 1, go to BO. 


(43) 




with r = 100 show that 
able; thus mutual exclusion is apparently guaranteed by ( 43 ). 


unsatisfi- 
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But ( 43 ) is a nonstarter, because it imposes an intolerable cost: Alice can’t 
use the room k times until Bob has already done so! Scrap that. 

How about installing another light, so that each person controls one of them? 


AO. Maybe go to Al. 

Al. If b go to Al, else to A2. 

A2. Set a I 1， go to A3. 

A3. Critical，go to A4. 

A4. Set a I 0, go to AO. 

No; this suffers from the same defect as 
the order of steps 1 and 2 : 


BO. Maybe go to Bl. 

Bl. If a go to Bl，else to B2. 

B2. Set 6 *<— 1, go to B3. ( 44 ) 

B3. Critical, go to B4. 

B4. Set 6 0, go to BO. 

( 40 ). But maybe we can cleverly switch 


deadlock 

reboot 

Dijkstra 

starvation 


AO. Maybe go to Al. 

Al. Set a 卜 1， go to A2. 

A2. If b go to A2, else to A3. 
A3. Critical, go to A4. 

A4. Set a 0, go to AO- 


BO. Maybe go to Bl. 

Bl. Set 6 I 1， go to B2. 

B2. If a go to B2, else to B3. ( 45 ) 

B3. Critical, go to B4. 

B4. Set 6 I 0, go to BO. 


Yes! Exercise 95 proves easily that this protocol does achieve mutual exclusion. 

Alas, however, a new problem now arises，namely the problem known as 
“deadlock” or “livelock •” Alice and Bob can get into states A2 and B2, after 
which they’re stuck 一 each waiting for the other to go critical. 

In such cases they could agree to “reboot” somehow. But that would be 
a cop-out; they really seek a better solution. And they aren’t alone: Many 
people have struggled with this surprisingly delicate problem over the years, and 
several solutions (both good and bad) appear in the exercises below. Edsger 
Dijkstra，in some pioneering lecture notes entitled Cooperating Sequential Pro¬ 
cesses [Technological University Eindhoven (September 1965) ， §2.1], thought of 
an instructive way to improve on ( 45 ): 

A0. Maybe go to Al. 

Al. Set a 卜 1, go to A2. 

A2. If b go to A3, else to A4. 

A3- Set a 1 0, go to Al. 

A4. Critical, go to A5. 

A5. Set a 0, go to A0. 

But he realized that this too is unsatisfactory, because it permits scenarios in 
which Alice ， say, might wait forever while Bob repeatedly uses the critical room. 
(Indeed, if Alice and Bob are in states Al and B2, she might go to A2, A3, 
then Al, thereby letting him run to B4, B5, B0, Bl, and B2; they’re back where 
they started，yet she’s made no progress.) 

The existence of this problem, called starvation, can also be detected via 
bounded model checking. The basic idea (see exercise 91) is that starvation 
occurs if and only if there is a loop of transitions 

JQ) —>• X\ 4 ... 4 X p Xp_(_i 4 ... 4 X r = X p (47) 

such that (i) Alice and Bob each are bumped at least once during the loop; and 
(ii) at least one of them is never in a “maybe” or “critical” state during the loop. 


B0. Maybe go to Bl. 

Bl. Set b <— 1^ go to B2. 

B2. If a go to B3, else to B4. 
B3. Set 6 I 0, go to Bl. 

B4. Critical, go to B5. 

B5. Set 6 1 0, go to B0. 


(46) 
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And those conditions are easily encoded into clauses, because we can identify 
the variables for time r with the variables for time p, and we can append the 
clauses 

(@^V @ P+ 1 V ... V @ r _i) A (@ p V @ P+ 1 V ... V @ r _i) (48) 

to guarantee (i). Condition (ii) is simply a matter of appending unit clauses; for 
example，to test whether Alice can be starved by ( 46 )，the relevant clauses are 
AOp A AOp+i 八 • • • 八 AO r _i A A 4 p A A 4 p+i 八 • • • 八 A 4 r _i. 

The deficiencies of ( 43 ) ，（ 45 )，and ( 46 ) can all be viewed as instances of 
starvation, because ( 47 ) and ( 48 ) are satisfiable (see exercise 90). Thus we 
can use bounded model checking to find counterexamples to any unsatisfactory 
protocol for mutual exclusion，either by exhibiting a scenario in which Alice and 
Bob are both in the critical room or by exhibiting a feasible starvation cycle ( 47 ). 

Of course we’d like to go the other way, too: If a protocol has no coun¬ 
terexamples for ， say, r = 100 , we still might not know that it is really reliable; 
a counterexample might exist only when r is extremely large. Fortunately there 
are ways to obtain decent upper bounds on r, so that bounded model checking 
can be used to prove correctness as well as to demonstrate incorrectness. For 
example，we can verify the simplest known correct solution to Alice and Bob’s 
problem，a protocol by G. L. Peterson [Information Proc. Letters 12 (1981) ， 115- 
116]，who noticed that a careful combination of ( 43 ) and ( 45 ) actually suffices: 


AO. Maybe go to Al. 

Al. Set a — 1， go to A2. 

A2. Set Z 0, go to A3. 

A3. If b go to A4, else to A5. 
A4. If l go to A5, else to A3- 
A5. Critical, go to A6. 

A6. Set a I 0, go to A0. 


B0. Maybe go to Bl. 

Bl. Set 6 i 1, go to B2. 

B2* Set Z 1， go to B3. 

B3. If a go to B4, else to B5- 
B4. IfZ go to B3, else to B5- 
B5. Critical, go to B6. 

B6. Set 6 0, go to B0. 


(49) 


Now there are three signal lights, a, 6 ， and l — one controlled by Alice, one 
controlled by Bob, and one switchable by both. 

To show that states A5 and B5 can’t be concurrent, we can observe that the 
shortest counterexample will not repeat any state twice; in other words, it will be 
a simple path of transitions ( 33 ). Thus we can assume that r is at most the total 
number of states. However, ( 49 ) has 7x7x2x2x2 = 392 states; that’s a finite 
bound, not really out of reach for a good SAT solver on this particular problem ， 
but we can do much better. For example ， it’s not hard to devise clauses that are 
satisfiable if and only if there’s a simple path of length < r (see exercise 92)，and 
in this particular case the longest simple path turns out to have only 54 steps. 

We can in fact do better yet by using the important notion of invariants, 
which we encountered in Section 1.2.1 and have seen repeatedly throughout this 
series of books. Invariant assertions are the key to most proofs of correctness, 
so it’s not surprising that they also give a significant boost to bounded model 
checking. Formally speaking, if is a Boolean function of the state vector X ， 
we say that $ is invariant if implies whenever X -> X 1 . For example ， 
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it’s not hard to see that the following clauses are invariant with respect to ( 49 ): 

$(X) = (A0VA1VA2VA3VA4VA5VA6) A (BOVB1 VB2VB3VB4VB5VB 6 ) 

A (A0Va)A(AlVa)A(AlVa)A(A3Va)A(AlVa)A(A5Va)A(A6Va) 

A (B0V6)A(BIV6)A(B2V6)A(B3V6)A(B4V6)A(B5V6)A(B6V6). ( 50 ) 

(The clause AO V a says that a = 0 when Alice is in state A0, etc.) And we can 
use a SAT solver to prove that $ is invariant，by showing that the clauses 

$(X) A (X X ') A ^(X 7 ) ( 51 ) 

are unsatisfiable. Furthermore $(X 0 ) holds for the initial state X 0 ，because 
-i$(X 0 ) is unsatisfiable. (See exercise 93.) Therefore is true for all t > 0, 

by induction, and we may add these helpful clauses to all of our formulas. 

The invariant ( 50 ) reduces the total number of states by a factor of 4. And 
the real clincher is the fact that the clauses 

(_Xq ~^ X \ ― y • 4 4 — y 八 $(_X"o) 八 伞 (义 i) 八...八 $(义广）八 A5 广八 B5^ , ( 52 ) 

where Xq is not required to be the initial state, turn out to be unsatisfiable 
when r = 3. In other words ， there’s no way to go back more than two steps 
from a bad state，without violating the invariant. We can conclude that mutual 
exclusion needs to be verified for ( 49 ) only by considering paths of length 2 (!). 
Furthermore，similar ideas (exercise 98) show that ( 49 ) is starvation-free. 

Caveat: Although ( 49 ) is a correct protocol for mutual exclusion according to 
Alice and Bob’s ground rules，it cannot be used safely on most modern computers 
unless special care is taken to synchronize cache memories and write buffers. The 
reason is that hardware designers use all sorts of trickery to gain speed, and those 
tricks might allow one process to see a = 0 at time t + 1 even though another 
process has set a ^ 1 at time t. We have developed the algorithms above 
by assuming a model of parallel computation that Leslie Lamport has called 

sequential consistency [IEEE Trans. C-28 (1979), 690-691]. 

Digital tomography. Another set of appealing questions amenable to SAT 
solving comes from the study of binary images for which partial information 
is given. Consider，for example, Fig. 36， which shows the “Cheshire cat” of 
Section 7.1.3 in a new light. This image is an m x n array of Boolean variables 
with m = 25 rows and n = 30 columns: The upper left corner element, 
: ri,i，is 0 , representing white; and = 1 corresponds to the lone black pixel 
in the top row. We are given the row sums — x iJ for 1 < i < m and 

the column sums Cj = Xi J ^ or 1 S S n ，as well as both sets of sums in 

the 45° diagonal directions, namely 

dd = ^2 and b d = ^2 Xi ,j for 0 < d < m + n. ( 53 ) 

i—j—d—n 

To what extent can such an image be reconstructed from its sums Cj, 
ad, and bd^ Small examples are often uniquely determined by these Xray-like 
projections (see exercise 103). But the discrete nature of pixel images makes 
the reconstruction problem considerably more difficult than the corresponding 
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b 54 = 0000000015464675466117677424389 


8 queens problem 
queens 
chessboard 
lexicographic order 


Fig- 36. An array of black and white pixels together with its 
row sums column sums Cj，and diagonal sums a 山 6 小 

continuous problem, in which projections from many different angles are avail¬ 
able. Notice，for example，that the classical a 8 queens problem” 一 to place eight 
nonattacking queens on a chessboard — is equivalent to solving an 8 x 8 digital 
tomography problem with the constraints T{ — 1 , Cj = 1 , < 1 , and bd < 1 . 

The constraints of Fig. 36 appear to be quite strict，so we might expect that 
most of the pixels Xij are determined uniquely by the given sums. For instance, 
the fact that ai = ••• = <25 = 0 tells us that Xij = 0 whenever i + j < 6 ; 
and similar deductions are possible at all four corners of the image. A crude 
“ballpark estimate” suggests that we’re given a few more than 150 sums，most 
of which occupy 5 bits each; hence we have roughly 150 x 5 = 750 bits of data, 
from which we wish to reconstruct 25 x 30 = 750 pixels Xij. Actually ， however, 
this problem turns out to have many billions of solutions (see Fig. 37)，most of 
which aren’t catlike! Exercise 106 provides a less crude estimate，which shows 
that this abundance of solutions isn’t really surprising. 



(a) lexicographically first; (b) maximally different; (c) lexicographically last. 


Fig. 37 - Extreme solutions to the constraints of Fig. 36- 
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A digital tomography problem such as Fig. 36 is readily represented as a 
sequence of clauses to be satisfied，because each of the individual requirements 
is just a special case of the cardinality constraints that weVe already considered 
in the clauses of ( 18 )—( 21 ). This problem differs from the other instances of SAT 
that we’ve been discussing, primarily because it consists entirely of cardinality 
constraints: It is a question of solving 25 + 30 + 54 + 54 = 163 simultaneous 
linear equations in 750 variables Xi ， j, where each variable must be either 0 or 1. 
So it’s essentially an instance of integer programming (IP)，not an instance of 
satisfiability (SAT). On the other hand, Bailleux and Boufkhad devised clauses 
( 20 ) and ( 21 ) precisely because they wanted to apply SAT solvers，not IP solvers ， 
to digital tomography. In the case of Fig. 36, their method yields approximately 
40,000 clauses in 9,000 variables，containing about 100,000 literals altogether. 

Figure 37(b) illustrates a solution that differs as much as possible from 
Fig. 36. Thus it minimizes the sum + $ 2,5 + $ 2,6 + ••• + ^ 25 , 2 i of the 
182 variables that correspond to black pixels，over all 0-or-1-valued solutions 
to the linear equations. If we use linear programming to minimize that sum 
over 0 < Xij < 1, without requiring the variables to be integers，we find almost 
instantly that the minimum value is ^ 31.38 under these relaxed conditions; 
hence every black-and-white image must have at least 32 black pixels in common 
with Fig. 36. Furthermore, Fig. 37(b) — which can be computed in a few seconds 
by widely available IP solvers such as CPLEX — actually achieves this minimum. 
By contrast, state-of-the-art SAT solvers as of 2013 had great difficulty finding 
such an image, even when told that a 32-in-common solution is possible. 

Parts (a) and (c) of Fig. 37 are ， similarly，quite relevant to the current state 
of the SAT-solving art: They represent hundreds of individual SAT instances, 
where the first k variables are set to particular known values and we try to 
find a solution with the next variable either 0 or 1 ， respectively. Several of the 
subproblems that arose while computing rows 6 and 7 of Fig. 37(c) turned out to 
be quite challenging, although resolvable in a few hours; and similar problems ， 
which correspond to different kinds of lexicographic order, apparently still lie 
beyond the reach of contemporary SAT-oriented methods. Yet IP solvers polish 
these problems off with ease. (See exercises 109 and 111.) 

If we provide more information about an image，our chances of being able 
to reconstruct it uniquely are naturally enhanced. For example，suppose we also 
compute the numbers r [’ a^，and b f d , which count the runs of Is that occur 
in each row ， column，and diagonal. (We have r[ = 1 ， = 2， = 4， and 
so on.) Given this additional data, we can show that Fig. 36 is the only solution ， 
because a suitable set of clauses turns out to be unsatisfiable. Exercise 117 
explains one way by which ( 20 ) and ( 21 ) can be modified so that they provide 
constraints based on the run counts. Furthermore, it isn’t difficult to express 
even more detailed constraints, such as the assertion that “column 4 contains 
runs of respective lengths ( 6 , 1 ， 3)，” as a sequence of clauses; see exercise 438. 

SAT examples —— summary. We’ve now seen convincing evidence that simple 
Boolean clauses — ANDs of ORs of literals — are enormously versatile. Among 
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other things, we’ve used them to encode problems of graph coloring，integer 
factorization, hardware fault testing, machine learning，model checking, and 
tomography. And indeed, Section 7.9 will demonstrate that 3SAT is the “poster 
child” for NP-complete problems in general: Any problem in NP — which is 
a huge class, essentially comprising all yes-or-no questions of size N whose 
affirmative answers are verifiable in N °^ steps — can be formulated as an 
equivalent instance of 3SAT, without greatly increasing the problem size. 

Backtracking for SAT. We’ve now seen a dizzying variety of intriguing and im¬ 
portant examples of SAT that are begging to be solved. How shall we solve them? 

Any instance of SAT that involves at least one variable can be solved sys¬ 
tematically by choosing a variable and setting it to 0 or 1. Either of those choices 
gives us a smaller instance of SAT; so we can continue until reaching either an 
empty instance — which is trivially satisfiable, because no clauses need to be 
satisfied — or an instance that contains an empty clause. In the latter case we 
must back up and reconsider one of our earlier choices, proceeding in the same 
fashion until we either succeed or exhaust all the possibilities. 

For example，consider again the formula F in (i). If we set x\ = 0, F reduces 
to X 2 八 ($2 VT 3 )，because the first clause (xi V x^) loses its while the last two 
clauses contain x\ and are satisfied. It will be convenient to have a notation for 
this reduced problem; so let’s write 

= X 2 A (x 2 Vx 3 ). ( 54 ) 

Similarly, if we set x\ — 1, we obtain the reduced problem 

F\X! = (x 2 Vx 3 )Ax 3 A(x 2 Vx 3 ). (55) 

F is satisfiable if and only if we can satisfy either ( 54 ) or ( 55 ). 

In general if F is any set of clauses and if l is any literal，then F 1 1 (read 
“F given Z” or “F conditioned on r) is the set of clauses obtained from F by 

• removing every clause that contains and 

• removing l from every clause that contains l. 

This conditioning operation is commutative, in the sense that F\l\V = F \ V \ l 
when l 1 / I. If 1/ = { “， • • • ， 1^} is any set of strictly distinct literals，we can also 
write F\L = F\li\^ - In these terms, F is satisfiable if and only if F | L = 0 
for some such L, because the literals of L satisfy every clause of F when F | L = 0. 

The systematic strategy for SAT that was sketched above can therefore be 
formulated as the following recursive procedure B(F), which returns the special 
value 丄 when F is unsatisfiable, otherwise it returns a set L that satisfies F: 

^ If F — 0, return 0. (F is trivially satisfiable.) 

Otherwise if e G F, return 丄 . (F is unsatisfiable.) 

B(F) = < Otherwise let l be a literal in F and set L i- B(F \ l). ( 56 ) 

If 1/ / 丄 ， return L\J l. Otherwise set L B(F \ l). 

、 If 1/ / 丄 ， return L\Jl. Otherwise return 丄 . 

Let’s try to flesh out this abstract algorithm by converting it to efficient 
code at a lower level. From our previous experience with backtracking, we know 
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that it will be crucial to have data structures that allow us to go quickly from 
F to F \ then back again to F if necessary，when F is a set of clauses and 
l is a literal. In particular, we’ll want a good way to find all of the clauses that 
contain a given literal. 

A combination of sequential and linked structures suggests itself for this 
purpose, based on our experience with exact cover problems: We can represent 
each clause as a set of cells, where each cell p contains a literal l = L (p) together 
with pointers F(p) and B(p) to other cells that contain /， in a doubly linked list. 
We^ll also need C(p)，the number of the clause to which p belongs. The cells of 
clause Ci will be in consecutive locations START (i) + jf，for 0 < j < SIZE(i). 

We will find it convenient to represent the literals Xk and Xk , which involve 
variable Xk, by using the integers 2k and 2k + 1. With this convention we have 

f = Z ©1 and |/| = $，》]_• ( 57 ) 

Our implementation of ( 56 ) will assume that the variables are X 2 , … ， x n ; 
thus the 2 n possible literals will be in the range 2 < / < 2 n + 1 . 

Cells 0 through 2n +1 are reserved for special purposes: Cell l is the head of 
the list for the occurrences of l in other cells. Furthermore C(D will be the length 
of that list，namely the number of currently active clauses in which l appears. 

For example, the m = 7 ternary clauses R 1 of ( 7 ) might be represented 
internally in 2n + 2 + 3m = 31 cells as follows，using these conventions: 

p = 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 

L(p)= - 973875653843862964742 

F(p)=--30 21 29 17 26 28 22 25 9 7 3 8 11 5 6 15 12 13 4 18 19 16 2 10 23 20 14 27 24 

B(p) = - - 24 12 20 15 16 11 13 10 25 14 18 19 28 17 23 5 21 22 27 3 8 26 30 9 6 29 7 4 2 

C(p)=-- 23323332777666555444333222111 

The literals of each clause appear in decreasing order here; for example, the 
literals L(p) = ( 8 , 4, 3) in cells 19 through 21 represent the clause ❹ V 奶 V xi ， 
which appears as the fourth clause, c 4l2’ in ( 7 ). This ordering turns out to be 
quite useful，because we’ll always choose the smallest unset variable as the l or l 
in ( 56 ); then l or I will always appear at the right of its clauses，and we can 
remove it or put it back by simply changing the relevant SIZE fields. 

The clauses in this example have START ⑴ = 31 — for 1 < i < 7, and 
SIZE ⑴ = 3 when computation begins. 

Algorithm A (Satisfiability by backtracking). Given nonempty clauses Ci A- • -A 
C m on n > 0 Boolean variables x\ • • .x n , represented as above，this algorithm 
finds a solution if and only if the clauses are satisfiable. It records its current 
progress in an array mi … m n of “moves，” whose significance is explained below. 

Al. [Initialize.] Set atm and d 1. (Here a represents the number of active 
clauses，and d represents the depth-plus-one in an implicit search tree.) 

A2. [Choose.] Set l •<— 2d. If C(0 < C(l + 1), set / + 1. Then set rrid -f- 

(/ & 1) + 4[CG ㊉ 1) = 0]. (See below.) Terminate successfully if C(l) = a. 

A3. [Remove L] Delete I from all active clauses; but go to A5 if that would make 
a clause empty. (We want to ignore 『， because we’re making l true.) 
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Fig. 38. The search tree that is implicitly traversed by Algorithm A, when 
that algorithm is applied to the eight unsatisfiable clauses R defined in (6). 
Branch nodes are labeled with the variable being tested; leaf nodes are labeled 
with a clause that is found to be contradicted. 

A4. [Deactivate Vs clauses.] Suppress all clauses that contain 1. (Those clauses 
are now satisfied.) Then set a a — C(D ， d 卜 d + 1， and return to A2. 

A5. [Try again.] If rrid < 2, set rrid I 3 — rrid, l t 2d+ (mj & 1)，and go to A3. 

A 6 . [Backtrack.] Terminate unsuccessfully if d = 1 (the clauses are unsatisfi¬ 
able). Otherwise set d ^ d — 1 and l 2d + (mj & 1). 

A7. [Reactivate Vs clauses.] Set a 4- a + C(D，and unsuppress all clauses that 
contain l. (Those clauses are now unsatisfied, because l is no longer true.) 

A 8 . [Unremove L] Reinstate I in all the active clauses that contain it. Then go 
back to A5. | 

(See exercise 121 for details of the low-level list processing operations that are 
needed to update the data structures in steps A3 and A4, and to downdate them 
in A7 and A 8 .) 

The move codes rrij of Algorithm A are integers between 0 and 5 that encode 
the state of the algorithm’s progress as follows: 

• rrij = 0 means we’re trying Xj = 1 and haven’t yet tried Xj = 0 . 

• rrij = 1 means we’re trying Xj = 0 and haven’t yet tried Xj = 1 . 

• rrij = 2 means we’re trying Xj = 1 after Xj = 0 has failed. 

• rrij = 3 means we’re trying Xj = 0 after Xj = 1 has failed. 

• rrij = 4 means we’re trying Xj = 1 when Xj doesn’t appear. 

• rrij = 5 means we’re trying Xj = 0 when Xj doesn’t appear. 

Codes 4 and 5 refer to so-called “pure literals” ： If no clause contains the literal 『， 
we can’t go wrong by assuming that l is true. 

For example，when Algorithm A is presented with the clauses ( 7 )，it cruises 
directly to a solution by setting mi m 2777 . 3777.4 = 1014; the solution is X 1 X 2 XSX 4 = 
0101. But when the unsatisfiable clauses ( 6 ) are given，the successive code strings 
mi … rrid in step A2 are 

1,11,110,1131,121,1211,1221,21,211,2111,2121,221,2221, ( 58 ) 

before the algorithm gives up. (See Fig. 38.) 
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It’s helpful to display the current string mi .. .rrid now and then，as a 
convenient indication of progress; this string increases lexicographically. Indeed ， 
fascinating patterns appear as the 2s and 3s gradually move to the left. (Try it!) 

When the algorithm terminates successfully in step A2, a satisfying assign¬ 
ment can be read off from the move table by setting 4- 1 ㊉ (rrij & 1) for 
I < j < d. Algorithm A stops after finding a single solution; see exercise 122 if 
you want them all. 

Lazy data structures. Instead of using the elaborate doubly linked machinery 
that underlies Algorithm A, we can actually get by with a much simpler scheme 
discovered by Cynthia A. Brown and Paul W. Purdom ， Jr. [IEEE Trans. PAMI- 
4 (1982) ， 309—316]，who introduced the notion of watched literals. They observed 
that we don’t really need to know all of the clauses that contain a given literal ， 
because only one literal per clause is actually relevant at any particular time. 

Here’s the idea: When we work on clauses F |L, the variables that occur in L 
have known values，but the other variables do not. For example，in Algorithm A ， 
variable Xj is implicitly known to be either true or false when j < but its value 
is unknown when j > d. Such a situation is called a partial assignment. A partial 
assignment is consistent with a set of clauses if no clause consists entirely of 
false literals. Algorithms for SAT usually deal exclusively with consistent partial 
assignments; the goal is to convert them to consistent total assignments, by 
gradually eliminating the unknown values. 

Thus every clause in a consistent partial assignment has at least one nonfalse 
literal; and we can assume that such a literal appears first，when the clause is 
represented in memory. Many nonfalse literals might be present, but only one of 
them is designated as the clause’s a watchee •” When a watched literal becomes 
false, we can find another nonfalse literal to swap into its place — unless the 
clause has been reduced to a unit, a clause of size 1. 

With such a scheme we need only maintain a relatively short list for every 
literal /， namely a list W/ of all clauses that currently watch l. This list can 
be singly linked. Hence we need only one link per clause; and we have a total 
of only 2n + m links altogether, instead of the two links for each cell that are 
required by Algorithm A. 

Furthermore —— and this is the best part! —— no updates need to be made 
to the watch lists when backtracking. The backtrack operations never falsify 
a nonfalse literal，because they only change values from known to unknown. 
Perhaps for this reason, data structures based on watched literals are called lazy, 
in contrast with the “eager” data structures of Algorithm A. 

Let us therefore redesign Algorithm A and make it more laid-back. Our 
new data structure for each cell p has only one field, L(p); the other fields F(p), 
B(p )， C(p) are no longer necessary，nor do we need 2n + 2 special cells. As 
before we will represent clauses sequentially，with the literals of Cj beginning at 
START (j) for I < j <m. The watched literal will be the one in START (j); and a 
new field ， LINK(jf)，will be the number of another clause with the same watched 
literal (or 0, if Cj is the last such clause). Moreover, our new algorithm won’t 
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need SIZE(j). Instead，we can assume that the final literal of Cj is in location 
START (j — 1) — 1，provided that we define START (0) appropriately. 

The resulting procedure is almost unbelievably short and sweet. It’s surely 
the simplest SAT solver that can claim to be efficient on problems of modest size: 

Algorithm B (Satisfiability by watching). Given nonempty clauses Ci A- • • f\C m 
on n > 0 Boolean variables x\ .. .x n ^ represented as above, this algorithm finds 
a solution if and only if the clauses are satisfiable. It records its current progress 
in an array mi .. .m n of “moves,” whose significance was explained above. 

Bl. [Initialize.] Set d <— 1. 

B2. [Rejoice or choose.] If d > n, terminate successfully. Otherwise set rrid ^ 
[W 2 d = 0 or W 2 d-\-i 7^ 0] and l t 2d rrid- 

B3. [Remove l if possible.] For all j such that I is watched in Cj, watch another 
literal of Cj. But go to B5 if that can’t be done. (See exercise 124.) 

B4. [Advance.] Set Wf l 0, d t d + 1 ， and return to B2. 

B5. [Try again.] If rrid < 2, set rrid ^ 3 — rrid ，/ ^ 2d + (md & 1)，and go to B3. 

B6. [Backtrack.] Terminate unsuccessfully if d = 1 (the clauses are unsatisfi- 
able). Otherwise set d ^ d — 1 and go back to B5. | 

Readers are strongly encouraged to work exercise 124， which spells out the 
low-level operations that are needed in step B3. Those operations accomplish 
essentially everything that Algorithm B needs to do. 

This algorithm doesn’t use move codes 4 or 5, because lazy data structures 
don’t have enough information to identify pure literals. Fortunately pure literals 
are comparatively unimportant in practice; problems that are helped by the pure 
literal shortcut can usually also be solved quickly without it. 

Notice that steps A2 and B2 use different criteria for deciding whether to 
try Xd = 1 or Xd = 0 first at each branch of the search tree. Algorithm A chooses 
the alternative that satisfies the most clauses; Algorithm B chooses to make l 
true instead of I if the watch list for l is empty but the watch list for l is not. 
(All clauses in which I is watched will have to change, but those containing l 
are satisfied and in good shape.) In case of a tie，both algorithms set l 1 ， 
which corresponds to Xd = 0. The reason is that human-designed instances of 
SAT tend to have solutions made up of mostly false literals. 

Forced moves from unit clauses. The simple logic of Algorithm B works 
well on many problems that aren’t too large. But its insistence on setting x\ 
first，then X 2 , etc., makes it quite inefficient on many other problems, because 
it fails to take advantage of unit clauses. A unit clause (l) forces l to be true; 
therefore two-way branching is unnecessary whenever a unit clause is present. 
Furthermore, unit clauses aren’t rare: Far from it. Experience shows that they’re 
almost ubiquitous in practice, so that the actual search trees often involve only 
dozens of branch nodes instead of thousands or millions. 

The importance of unit clauses was recognized already in the first computer 
implementation of a SAT solver，designed by Martin Davis, George Logemann ， 
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and Donald Loveland [CACM 5 (1962) ， 394-397] and based on ideas that Davis 
had developed earlier with Hilary Putnam [JACM 7 (1960), 201-215]. They 

extended Algorithm A by introducing mechanisms that recognize when the size 
of a clause decreases to 1 ， or when the number of unsatisfied clauses containing 
a literal drops to 0. In such cases, they put variables onto a “ready list，” and 
assigned those variables to fixed values before doing any further two-way branch¬ 
ing. The resulting program was fairly complex; indeed, computer memory was 
so limited in those days，they implemented branching by writing all the data for 
the current node of the search tree onto magnetic tape, then backtracking when 
necessary by restoring the data from the most recently written tape records! The 
names of these four authors are now enshrined in the term “DPLL algorithm,” 
which refers generally to SAT solving via partial assignment and backtracking. 

Brown and Purdom，in the paper cited earlier，showed that unit clauses 
can be detected more simply by using watched literals as in Algorithm B. We 
can supplement the data structures of that algorithm by introducing indices 
hi .. .h n so that the variable whose value is being set at depth d is Xh d instead 
of Xd- Furthermore we can arrange the not-yet-set variables whose watch lists 
aren’t empty into a circular list called the “active ring”; the idea is to proceed 
through the active ring，checking to see whether any of its variables are currently 
in a unit clause. We resort to two-way branching only if we go all around the 
ring without finding any such units. 

For example 5 let’s consider the 32 unsatisfiable clauses of waerden(3^3] 9) 
in ( 9 ). The active ring is initially (12 345 6 7), because 8 , 8 , 9, and 9 aren’t 
being watched anywhere. There are no unit clauses yet. The algorithm below will 
decide to try I first; then it will change the clauses 123, 135, 147, and 159 to 213, 
315, 417, and 519, respectively，so that nobody watches the false literal 1. The 
active ring becomes (2 345 6 7) and the next choice is 2; so 213, 234, 246, and 258 
morph respectively into 312, 324, 426, 528. Now, with active ring (345 6 7), the 
unit clause ‘3’ is detected (because 1 and 2 are false in C 31T). This precipitates 
further changes，and the first steps of the computation can be summarized thus: 
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Choice Changed clauses 

1 213,315,417,519 

2 312,324,426,528 

3 435,534,639 

4 624,714,546,648 

6 936,768 



867,879 



5, 5 Backtrack 

4 534,546,648 

5 5 456, 825,915,657,759 


When 6 is found, 7 is also a unit clause; but the algorithm doesn’t see it yet, 
because variable xq is tested first. The active ring changes first to (7 5) after 6 
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Fig. 39. The search tree that is implicitly traversed by Algorithm D, when 
that algorithm is applied to the eight unsatisfiable clauses R defined in (6). 
Branch nodes are labeled with the variable being tested; leaf nodes are labeled 
with a clause that is found to be contradicted. When the right child of a 
branch node is a leaf, the left branch was forced by a conditional unary clause. 


is found, because 5 is cyclically after 6; we want to look at 7 before 5, instead of 
revisiting more or less the same clauses. After 6 has been chosen, 9 is inserted at 
the left, because the watch list for 9 becomes nonempty. After backtracking, vari¬ 
ables 8, 7, 9, 6 are successively inserted at the left as they lose their forced values. 

The following algorithm represents the active ring by giving a NEXT field to 
each variable, with x N ext(/c) the successor of The ring is accessed via “head” 
and “tail” pointers h and t at the left and right，with h = NEXT(t). If the ring 
is empty, however， 6 = 0, and h is undefined. 


Algorithm D (Satisfiability by cyclic DPLL). Given nonempty clauses Ci A - • - A 
C m on n > 0 Boolean variables x\ • • • x n , represented with lazy data structures 
and an active ring as explained above, this algorithm finds a solution if and only 
if the clauses are satisfiable. It records its current progress in an array hi .. .h n of 
indices and an array mo … m n of “moves，” whose significance is explained below. 

Dl. [Initialize.] Set mo and do the following for A: = n，n — 1 ， 

• • • ， 1: Set Xk < - 1 (denoting an unset value); if W^k / 0 or W 2 /C +1 / 0, 

set NEXT(fc) /i, /i •<— fc, and if t = 0 also set t ^ k. Finally, if t 7 ^ 0, 
complete the active ring by setting NEXT(t) ^ h. 

D2. [Success?] Terminate if t = 0 (all clauses are satisfied). Otherwise set k ^ t. 

D3. [Look for unit clauses.] Set h ^ NEXT (AO and use the subroutine in exer¬ 
cise 129 to compute / [2h is a unit] + 2[2h + 1 is a unit]. If / = 3, go 

to D7. If / = 1 or 2, set rrid+i 卜 / + 3, 1 1 众 ， and go to D5. Otherwise，if 
h ^ set k <- h and repeat this step. 


D4. [Two-way branch.] Set h NEXT(t) and rrid+i ^ [W 2/1 = 0 or W 2 / 1+1 / 0]. 

D5. [Move on.] Set d ^ d+1^ hd k h. lft = k^ set t ^ 0; otherwise delete 
variable k from the ring by setting NEXT(t) ^ h NEXT (AO. 

D6. [Update watches.] Set b ^ (m^ + 1) mod 2, Xk ^ and clear the watch list 
for Xk (see exercise 130). Return to D2. 


September 23, 2015 
















34 


COMBINATORIAL ALGORITHMS (F6A: 23 Sep 2015 @ 1959) 


7 . 2 . 2.2 


D7. [Backtrack.] Set t i- k. While rrid > 2, set k ^ hd^ Xk < - 1; if / 0 or 

W 2 k +1 7 ^ o, set NEXT(fc) t h, h t k, NEXT(t) h; and setd^-d-1. 

D8. [Failure?] If d > 0, set rrid I 3 — k ^ hd^ and return to D 6 . Otherwise 
terminate the algorithm (because the clauses aren’t satisfiable). | 

The move codes of this algorithm are slightly different from the earlier ones: 


rrij = 0 means we’re trying 
rrij = 1 means we’re trying 
rrij = 2 means we’re trying Xhj 
rrij = 3 means we’re trying 
rrij — 4 means we’re trying 
— 5 means we’re trying xj^ 


1 and haven’t yet tried Xh 5 = 0 . 

0 and haven’t yet tried — 1 . 

1 after x\ lj — 0 has failed. 

0 after Xhj = 1 has failed. 

1 because it’s forced by a unit clause 
0 because it’s forced by a unit clause 


As before, the number of two-way branch nodes in the implicit search tree is the 
number of times that rrij is set to 0 or 1 . 
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Comparison of the algorithms. OK, we’ve just seen three rudimentary SAT 
solvers. How well do they actually do? Detailed performance statistics will be 
given later in this section, after we’ve studied several more algorithms. But a 
brief quantitative study of Algorithms A ， B, and D now will give us some concrete 
facts with which we can calibrate our expectations before moving on. 

Consider, for example, langford(n), the problem of Langford pairs. This 
problem is typical of SAT instances where many unit clauses arise during the 
computation. For example，when Algorithm D is applied to langford (5), it 
reaches a stage where the move codes are 


mim 2 ...m d = 1255555555555555114545545, ( 6 o) 

indicating only four two-way branches (the Is and the 2 ) amongst a sea of forced 
moves. We therefore expect Algorithm D to outperform Algorithms A and B ， 
which don’t capitalize on unit clauses. 

Sure enough, Algorithm D wins (slightly), even on a small example such as 
langford (b), which has 213 clauses, 480 cells, 28 variables. The detailed stats are 

Algorithm A: 5379 + 108952 mems, 10552 bytes, 705 nodes. 
Algorithm B: 1206 + 30789 mems, 4320 bytes, 771 nodes. 

Algorithm D: 1417 + 28372 mems, 4589 bytes, 11 nodes. 

(Here “5379 + 108952 mems” means that 5379 memory accesses were made while 
initializing the data structures before the algorithm began; then the algorithm 
itself accessed octabytes of memory 108,952 times.) Notice that Algorithm B 
is more than thrice as fast as Algorithm A in this example，although it makes 
771 two-way branches instead of 705. Algorithm A needs fewer nodes, because 
it recognizes pure literals; but Algorithm B does much less work per node. 
Algorithm D，on the other hand, works very hard at each node, yet comes out 
ahead because its decision-making choices reduce the search to only a few nodes. 
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These differences become more dramatic when we consider larger problems. 
For instance, langford (9) has 1722 clauses，3702 cells, 104 variables, and we find 

Algorithm A: 332.0 megamems, 77216 bytes, 1,405,230 nodes. 
Algorithm B: 53.4 megamems，31104 bytes, 1,654,352 nodes. 
Algorithm D: 23.4 megamems，32057 bytes, 6093 nodes. 

And with langford(13)’s 5875 clauses, 12356 cells, 228 variables, the results are 

Algorithm A: 2699.1 gigamems ， 253.9 kilobytes, 8.7 giganodes. 
Algorithm B: 305.2 gigamems ， 101.9 kilobytes, 10.6 giganodes. 
Algorithm D: 71.7 gigamems ， 104.0 kilobytes, 14.0 meganodes. 

Mathematicians will recall that, at the beginning of Chapter 7， we used 
elementary reasoning to prove the unsatisfiability of langford (4k + 1) for all k. 
Evidently SAT solvers have great difficulty discovering this fact, even when k is 
fairly small. We are using that problem here as a benchmark test, not because we 
recommend replacing mathematics by brute force! Its unsatisfiability actually 
enhances its utility as a benchmark, because algorithms for satisfiability are more 
easily compared with respect to unsatisfiable instances: Extreme variations in 
performance occur when clauses are satisfiable，because solutions can be found 
purely by luck. Still, we might as well see what happens when our three algo¬ 
rithms are set loose on the satisfiable problem langford (16)，which turns out to be 
“no sweat.” Its 11494 clauses, 23948 cells, and 352 variables lead to the statistics 

Algorithm A: 11262.6 megamems ， 489.2 kilobytes, 28.8 meganodes. 
Algorithm B: 932.1 megamems ， 196.2 kilobytes, 40.9 meganodes. 
Algorithm D: 4.9 megamems ， 199.4 kilobytes, 167 nodes. 

Algorithm D is certainly our favorite so far，based on the langford data. But 
it is far from a panacea, because it loses badly to the lightweight Algorithm B 
on other problems. For example, the 2779 unsatisfiable clauses, 11662 cells, and 
97 variables of waerden(3^ 10; 97) yield 

Algorithm A: 150.9 gigamems ， 212.8 kilobytes, 106.7114 meganodes. 
Algorithm B: 6.2 gigamems ， 71.2 kilobytes, 106.7116 meganodes. 
Algorithm D: 1430.4 gigamems ， 72.1 kilobytes, 102.7 meganodes. 

And waerden(3^ 10; 96)’s 2721 satisfiable clauses，11418 cells，96 variables give us 

Algorithm A: 96.9 megamems, 208.3 kilobytes, 72.9 kilonodes. 

Algorithm B: 12.4 megamems, 69.8 kilobytes, 207.7 kilonodes. 

Algorithm D: 57962.8 megamems ， 70.6 kilobytes, 4447.7 kilonodes. 

In such cases unit clauses don’t reduce the search tree size by very much，so we 
aren’t justified in spending so much time per node. 

* Speeding up by working harder. Algorithms A, B ， and D are OK on smallish 
problems，but they cannot really cope with the larger instances of SAT that have 
arisen in our examples. Significant enhancements are possible if we are willing 
to do more work and to develop more elaborate algorithms. 

Mathematicians generally strive for nice ， short，elegant proofs of theorems; 
and computer scientists generally aim for nice, short, elegant sequences of steps 
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with which a problem can quickly be solved. But some theorems have no short 
proofs，and some problems cannot be solved efficiently with short programs. 

Let us therefore adopt a new attitude, at least temporarily，by fearlessly 
deciding to throw lots of code at SAT: Let’s look at the bottlenecks that hinder 
Algorithm D on large problems, and let’s try to devise new methods that will 
streamline the calculations even though the resulting program might be ten times 
larger. In this subsection we shall examine an advanced SAT solver，Algorithm L ， 
which is able to outperform Algorithm D by many orders of magnitude on many 
important problems. This algorithm cannot be described in just a few lines; 
but it does consist of cooperating procedures that are individually nice ， short ， 
elegant, and understandable by themselves. 

The first important ingredient of Algorithm L is an improved mechanism 
for unit propagation. Algorithm D needs only a few lines of code in step D3 to 
discover whether or not the value of an unknown variable has been forced by 
previous assignments; but that mechanism isn’t particularly fast, because it is 
based on indirect inferences from a lazy data structure. We can do better by 
using “eager” data structures that are specifically designed to recognize forced 
values quickly，because high-speed propagation of the consequences of a newly 
asserted value turns out to be extremely important in practice. 

A literal l is forced true when it appears in a clause C whose other literals 
have become false，namely when the set of currently assigned literals L has re¬ 
duced C to the unit clause C\L — (l). Such unit clauses arise from the reduction 
of binary clauses. Algorithm L therefore keeps track of the binary clauses (u\/v) 
that are relevant to the current subproblem F \ L. This information is kept 
in a so-called “bimp table” BIMP(D for every literal which is a list of other 
literals l 1 whose truth is implied by the truth of 1. Indeed，instead of simply 
including binary clauses within the whole list of given clauses, as Algorithms A ， 
B，and D do, Algorithm L stores the relevant facts about (u\/ v) directly，in a 
ready-to-use way, by listing u in BIMP(^) and v in BIMP(u). Each of the 2n 
tables BIMP(0 is represented internally as a sequential list of length BSIZE(D ， 
with memory allocated dynamically via the buddy system (see exercise 134). 

Binary clauses, in turn，are spawned by ternary clauses. For simplicity, 
Algorithm L assumes that all clauses have length 3 or less，because every instance 
of general SAT can readily be converted to 3SAT form (see exercise 28). And for 
speed，Algorithm L represents the ternary clauses by means of “timp tables,” 
which are analogous to the bimp tables: Every literal l has a sequential list 
TIMP(D of length TSIZE(D ， consisting of pairs pi = (^i^i), P 2 = ( 以2，仍）， …， 
such that the truth of l implies that each (ui V Vi) becomes a relevant binary 
clause. If (u V v V ti;) is a ternary clause，there will be three pairs p = (v^w)^ 
p 1 = (w ， u ) ， and p n = (u ， v), appearing in the respective lists TIMP(iZ) ， TIMP(U )， 
and TIMP(tD). Moreover，these three pairs are linked together cyclically, with 

LINK(p) =p', LINK(p 7 ) =p", LINK(p ,/ ) = p. ( 61 ) 

Memory is allocated for the timp tables once and for all, as the clauses are input, 
because Algorithm L does not generate new ternaries during its computations. 
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Individual pairs p are, however, swapped around within these sequential tables, 
so that the currently active ternary clauses containing u always appear in the 
first TSIZE(u) positions that have been allocated to TIMP(u). 

For example, let’s consider again the ternary clauses ( 9 ) of waerden(3^ 3; 9). 
Initially there are no binary clauses, so all BIMP tables are empty. Each of the 
ternary clauses appears in three of the TIMP tables. At level 0 of the search 
tree we might decide that =0; then TIMP (5) tells us that we gain eight 
binary clauses, namely {13, 19, 28, 34, 37,46,67, 79}. These new binary clauses 
are represented by sixteen entries in BIMP tables; BIMP (3), for instance, will now 
be {1 ， 4,7}. Furthermore ， we’ll want all of the TIMP pairs that involve either 
5 or 5 to become inactive，because the ternary clauses that contain 5 are weaker 
than the new binary clauses，and the ternary clauses that contain 5 are now 
satisfied. (See exercise 136.) 

As in ( 57 ) above, we shall assume that the variables of a given formula are 
numbered from 1 to n，and we represent the literals k and k internally by the 
numbers 2k and 2k+l. Algorithm L introduces a new twist, however，by allowing 
variables to have many different degrees of truth [see M. Heule ， M. Dufour，JL van 

Zwieten, and H. van Maaren, LNCS 3542 (2005), 345-359]: We say that Xk is 
true with degree D if VAL [fc] = D, and false with degree D if VAL [fc] = D + 1, 
where D is any even number. 

The highest possible degree, typically 2 32 — 2 inside a computer, is called 
RT for “real truth.^ The next highest degree, typically 2 32 — 4, is called NT for 
“near truth” ； and then comes PT = 2 32 — 6 ， “proto truth.^ Lower degrees PT — 2, 
PT — 4, •…， 2 also turn out to be useful. A literal l is said to be fixed in context 
T if and only if VAL [|/|] > T; it is fixed true if we also have VAL [|/|] & 1 = / & 1, 
and it is fixed false if its complement l is fixed true. 

Suppose, for example, that VAL [2] = RT + 1 and VAL [7] = PT; hence X 2 is 
“really false” while is “proto true •” Then the literal c 7\ represented internally 
by l = 14, is fixed true in context PT ， but l is not fixed in contexts NT or RT. The 
literal c 2\ represented internally by Z = 5, is fixed true in every context. 

Algorithm L uses a sequential stack i? 0 ， i^，• • • ， to record the names of 
literals that have received values. The current stack size, satisfies Q < E < n. 
With those data structures we can use a simple breadth - first search procedure 
to propagate the binary consequences of a literal l in context T at high speed: 

Set H ^ E] take account of l] 

while H < E, set l i- Rh, H t H 1, and ( 62 ) 

take account of V for all l 1 in BIMP(0. 

Here “take account of r means u if l is fixed true in context T，do nothing; if l is 
fixed false in context T, go to step CONFLICT; otherwise set VAL [|/|] 4 - T +{7&1), 
Re ^ and E E + 1.” The step called CONFLICT is changeable. 

A literal’s BIMP table might grow repeatedly as computation proceeds. But 
we can undo the consequences of bad decisions by simply resetting BSIZE(D 
to the value that it had before those decisions were made. A special variable 
ISTAMP is increased whenever we begin a new round of decision - making，and each 
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literal l has its private stamp IST(0. Whenever BSIZE(D is about to increase ， 
we check if IST(D = ISTAMP. If not，we set 

IST(0 ISTAMP, ISTACK[I] (/,BSIZE(0), / 4 - / + 1. ( 63 ) 

Then the entries on ISTACK make it easy to downdate the BIMP tables when we 
backtrack. (See step L13 in the algorithm below.) 

We’re ready now to look at the detailed steps of Algorithm L, except that one 
more member of its arsenal of data structures needs to be introduced: There’s 
an array VAR, which contains a permutation of {1，• • • ， n}，with VAR [A:] = x if 
and only if INXDr] = k. Furthermore VAR [A:] is a “free variable” 一 not fixed 
in context RT — if and only if 0 < A; < This setup makes it convenient to 
keep track of the variables that are currently free: A variable becomes fixed by 
swapping it to the end of the free list and decreasing N (see exercise 137); then 
we can free it later by simply increasing N, without swapping. 

Algorithm L (Satisfiability by DPLL with lookahead). Given nonempty clauses 
(7i 八…八 C m of size < 3 5 on n > 0 Boolean variables x\ • • .x n , this algorithm 
finds a solution if and only if the clauses are satisfiable. Its family of cooperating 
data structures is discussed in the text. 

LI. [Initialize.] Record all binary clauses in the BIMP array and all ternary 
clauses in the TIMP array. Let U be the number of distinct variables in unit 
clauses; terminate unsuccessfully if two unit clauses contradict each other ， 
otherwise record all distinct unit literals in FORCE [W for 0 < k < U. Set 
VAR[fc] •<— fc + 1 and INX [k + 1] fc for 0 < fc < n; and d •<— F 
ISTAMP •(- 0. (Think d = depth, F = fixed variables, I = ISTACK size.) 

L2. [New node.] Set BRANCH [d] i - 1. If U = 0, invoke Algorithm X below 

(which looks ahead for simplifications and also gathers data about how to 
make the next branch). Terminate happily if Algorithm X finds all clauses 
satisfied; go to L15 if Algorithm X discovers a conflict; go to L5 if U > 0. 

L3. [Choose L] Select a literal l that’s desirable for branching (see exercise 
168). If Z = 0, set d 4- d + 1 and return to L2. Otherwise set DEC W] 4- Z ， 
BACKFW] — F, BACKI W] — J, and BRANCH [d] ^ 0. 

L4. [Try l] Set U i-1, FORCE [0] 4 - 1. 

L5. [Accept near truths.] Set T ^ NT, G ^ .E ^ F, ISTAMP 4 - ISTAMP + 1, 
and CONFLICT 卜 Lll. Perform the binary propagation routine ( 62 ) for 
l <- FORCE [0]FORCE [/7 - 1]; then set U i-0. 

L 6 . [Choose a nearly true L] (At this point the stacked literals Rk are “really 
true” ior 0 <k < and “nearly true” ior G <k < E. We want them all 
to be really true.) If G = go to L10. Otherwise set L ^ Rq, G t G + l. 

L7. [Promote L to real truth.] Set X \L\ and VAL[X] ^ RT + L&l. Remove 
variable X from the free list and from all TIMP pairs (see exercise 137). Do 
step L 8 for all pairs (u^v) in TIMP(L), then return to L 6 . 

L 8 . [Consider u V v] (We have deduced that u or v must be true; five cases 
arise.) If either u or r is fixed true (in context T，which equals NT)，do 
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nothing. If both u and v are fixed false, go to CONFLICT. If u is fixed false 
but v isn’t fixed, perform ( 62 ) with l ^ v. If v is fixed false but u isn’t 
fixed, perform ( 62 ) with l i- u. If neither u nor v is fixed，do step L9. 

L9. [Exploit u V v] If v G BIMP(iZ)，perform ( 62 ) with l i- u (because u 
implies both v and v). Otherwise if G BIMP(u)，do nothing (because we 
already have the clause u\/v). Otherwise if iZ G BIMP(v) ^ perform ( 62 ) with 
l ^r- v. Otherwise append v to BIMP(u) and u to BIMP(^). (Each change 
to BIMP means that ( 63 ) might be invoked. Exercise 139 explains how to 
improve this step by deducing further implications called “compensation 
resolvents .”） 

L10. [Accept real truths.] Set F <— E. If BRANCH [d] > 0, set d •<— + 1 and go 

to L2. Otherwise go to L3 if d > 0, to L2 if d = 0. 

Lll. [Unfix near truths.] While E > G, set E <— E — 1 and VAL[|J?£；|] 0. 

LI2. [Unfix real truths.] While 五〉 F, do the following: Set E <— E — 1 and 

X \Re\] reactivate the TIMP pairs that involve X and restore X to the 
free list (see exercise 137); set VAL[X] 卜 0. 

L13. [Downdate BIMPs.] If BRANCH [d] >0, do the following while I > BACKI W]: 
Set I ^ I -1 and BSIZE(0 5 , where ISTACK[/] = 

L14. [Try again?] (We’ve discovered that DEC W] doesn’t work.) If BRANCH [d]= 
0, set Z f DEC Id ], DEC W] ^ l BRANCH [d] f 1， and go back to L4. 

L15. [Backtrack.] Terminate unsuccessfully if d = 0. Otherwise set d d — 1 ， 
五 •<— F, F •<— BACKF [d], and return to L12. | 

Exercise 143 extends this algorithm so that it will handle clauses of arbitrary size. 

* Speeding up by looking ahead. Algorithm L as it stands is incomplete, 
because step L2 relies on an as-yet-unspecified “Algorithm X” before choosing a 
literal for branching. If we use the simplest possible Algorithm X ， by branching 
on whatever literal happens to be first in the current list of free variables, the 
streamlined methods for propagating forced moves in ( 62 ) and ( 63 ) will tend to 
make Algorithm L run roughly three times as fast as Algorithm D, and that isn’t 
a negligible improvement. But with a sophisticated Algorithm X we can often 
gain another factor of 10 or more in speed, on significant problems. 

For example，here are some typical empirical statistics: 


Problem 

waerden(3^ 10; 97) 
langford (13) 
rand (3, 420,100, 0) 


Algorithm D 
1430 gigamems ， 
103 meganodes 
71.7 gigamems ， 
14.0 meganodes 
184 megamems ， 

34 kilonodes 


Algorithm L° 
391 gigamems ， 

31 meganodes 
21.5 gigamems ， 
10.9 meganodes 
34 megamems ， 
7489 nodes 


Algorithm L+ 
772 megamems ， 
4672 nodes 
45.7 gigamems ， 
944 kilonodes 
626 kilomems ， 
19 nodes 


compensation resolvents 

langford 

waerden 

randj- 


Here Algorithm L° stands for Algorithm L with the simplest Algorithm X, while 
Algorithm L + uses all of the lookahead heuristics that we are about to discuss. 
The first two problems involve rather large clauses，so they use the extended 
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Algorithm L of exercise 143. The third problem consists of 420 random ternary 
clauses on 100 variables. (Algorithm B ， incidentally, needs 80.1 teramems，and 
a search tree of 4.50 ter anodes, to show that those clauses are unsatisfiable.) 

The moral of this story is that it’s wise to do 100 times as much computation 
at every node of a large search tree, if we can thereby decrease the size of the 
tree by a factor of 1000 . 

How then can we distinguish a variable that’s good for branching from a 
variable that isn’t? We shall consider a three-step approach: 

• Preselecting，to identify free variables that appear to be good candidates; 

• Nesting，to allow candidate literals to share implied computations; 

• Exploring，to examine the immediate consequences of hypothetical decisions. 
While carrying out these steps，Algorithm X might discover a contradiction (in 
which case Algorithm L will take charge again at step L15); or the lookahead 
process might discover that several of the free literals are forced to be true 
(in which case it places them in the first U positions of the FORCE array). The 
explorations might even discover a way to satisfy all of the clauses (in which case 
Algorithm L will terminate and everybody will be happy). Thus，Algorithm X 
might do much more than simply choose a good variable on which to branch. 

The following recommendations for Algorithm X are based on Marijn Heule’s 
lookahead solver called march，one of the world’s best, as it existed in 2013. 

The first stage, preselection, is conceptually simplest，although it also in¬ 
volves some “handwaving” because it depends on necessarily shaky assumptions. 
Suppose there are N free variables. Experience has shown that we tend to get 
a good heuristic score h(l) for each literal representing the relative amount by 
which asserting l will reduce the current problem, if these scores approximately 
satisfy the simultaneous nonlinear equations 

h(l) = 0.1 -h a h(u) -f- h(u)h(v). ( 64 ) 

u not fixed 

A 

Here a is a magic constant, typically 3.5; and h{l) is a multiple of h{l) chosen 

/\ 

so that h h{l) = 2N is the total number of free literals. (In other words，the h 
scores on the right are “normalized” so that their average is 1 .) 

Any given set of scores h(l) can be used to derive a refined set h’ （ l) by letting 


h 1 (l) = 0.1 + a 

^GBIMP(/) 
u not fixed 


h(u) 



+ E 

(t^)GTIMP(Z) 


h{u) h(v) 


h 


ave 



h^e = ^^2h(l). ( 65 ) 

l 


Near the root of the search tree, when d < 1^ we start with h{l) = 1 for all l 
and then refine it five times (say). At deeper levels we start with the h(l) values 
from the parent node and refine them once. Exercise 145 contains an example. 

We’ve computed h(l) for all of the free literals but we won’t have time to 
explore them all. The next step is to select free variables CAND [j] for 0 < j < 
where C isn’t too large; we will insist that the number of candidates does not 
exceed 


random ternary clauses 
Heule 

march 

heuristic score 


September 23, 2015 


C^max 


max((7o, Ci/d) 


( 66 ) 



7 . 2 . 2.2 SATISFIABILITY: BACKTRACKING ALGORITHMS 41 

using cutoff parameters that are typically Cq = 30, C\ — 600. (See exercise 148.) 

We start by dividing the free variables into “participants” and “newbies”: 
A participant is a variable such that either x or x has played the role ofu or v in 
step L 8 , at some node above us in the search tree; a newbie is a nonparticipant. 
When d = 0 every variable is a newbie, because we’re at the root of the tree. 
But usually there is at least one participant, and we want to branch only on 
participants whenever possible, in order to maintain focus while backtracking. 

If we’ve got too many potential candidates, even after restricting consider¬ 
ation to participants, we can winnow the list down by preferring the variables x 
that have the largest combined score h(x)h(x). Step X3 below describes a fairly 
fast way to come up with the desired selection of C < (7 max candidates. 

A simple lookahead algorithm can now proceed to compute a more accurate 
heuristic score 丑 (/)，for each of the 2C literals l = CAND[j] or l = -iCAND [j] 
that we’ve selected for further scrutiny. The idea is to simulate what would 
happen if l were used for branching, by mimicking steps L4-L9 (at least to a 
first approximation) : Unit literals are propagated as in the exact algorithm, but 
whenever we get to the part of step L9 that changes the BIMP tables, we don’t 
actually make such a change; we simply note that a branch on l would imply 
u\J V 、 and we consider the value of that potential new clause to be h{u)h{v). 
The heuristic score H{1) is then defined to be the sum of all such clause weights: 

asserting l in L4 leads to asserting u\/ v in L9}. ( 67 ) 

For example, the problem waerden(3^ 3; 9) of ( 9 ) has nine candidate variables 
{1 ， 2,… ， 9} at the root of the search tree, and exercise 145 finds their rough 
heuristic scores h(l). The more discriminating scores H{1) turn out to be 

H(l) = h(2)h(3) + h(3)h(5) + h(4)h(7) + h(5)h(9) = 168.6; 

H(2) = h(l)h(3) + h(3)h(4) + h(4)h(6) + h(5)h(8) = 157.3; 

H(3) = h(l)h(2) + h(2)h(4) + h(4)h(5) + ... + h(6)h(9) = 233.4; 

= h(2)h(3) -f- h(3)h(5) -h h(5)h(6) + ... + h(l)h(7) = 231.8; 

H(5) = h(3)h(4) + h(3)h(6) + h(6)h(7) + ... + /i(l)/i(9) - 284.0. 

This problem is symmetrical, so we also have 丑 ( 6 )= 丑 ( 6 ) = H(4) = H(4 )， etc. 
The best literal for branching，according to this estimate, is 5 or 5. 

Suppose we set false and proceed to look ahead at the reduced problem ， 
with d = 1. At this point there are eight candidates ， {1 ， 2, 3,4, 6 , 7, 8 , 9}; and 
they’re now related also by binary implications，because the original clause c 357’ 
has，for instance, been reduced to ‘37’. In fact，the BIMP tables now define the 
dependency digraph 

61 M 19) (8) 

— _ I (68) 

because 3 —7 ， etc.; and in general the 2C candidate literals will define a 
dependency digraph whose structure yields important clues about the current 
subproblem. We can，for example，use Tarjan’s algorithm to find the strong 
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components of that digraph, as mentioned after Theorem 7.1.IK. If some strong 
component includes both l and T, the current subproblem is unsatisfiable. Other¬ 
wise two literals of the same component are constrained to have the same value; 
so we shall choose one literal from each of the S < 2C strong components, and 
use those choices as the actual candidates for lookahead. 

Continuing our example, at this point we can use a nice trick to save 
redundant computation, by extracting a subforest of the dependency digraph: 



The relation 8 — 2 means that whatever happens after asserting the literal 
will also happen after asserting c 8 ’； hence we need not repeat the steps for 
while studying c 8\ And similarly，each of the other subordinate literals c l\ 
• • • ， ‘9’ inherits the assertions of its parent in this hierarchy. Tarjan’s algorithm 
actually produces such a subforest with comparatively little extra work. 

The nested structure of a forest also fits beautifully with “degrees of truth” 
in our data structure，if we visit the S candidate literals in preorder of the 
subforest ， and if we successively assert each literal l at the truth degree that 
corresponds to twice its position in postorder. For instance, ( 6 g) becomes the 
following arrangement, which we shall call the “lookahead forest ”： 


preorder 1283464736982917 
2-postorder 2 6 4 8 10 14 12 22 16 18 20 26 24 32 28 30 



A simulation of steps L4-L9 with l ^ 1 and T 卜 2 makes x\ true at degree 2 (we 
say that it’s “2fixed” or “2true ”）； it also computes the score H(l) <— h(2)h(3) + 
h(4)h(7)^ but it spawns no other activity if Algorithm Y below isn’t active. 
Simulation with l •<— 2 and T •<— 6 then 6 fixes 2 and computes H(2) •<— 
h(4)h(6)] during this process the value of isn’t seen，because it is less than T. 
But things get more interesting when l ^ 8 and T 4: Now we 4fix 8 , and we’re 
still able to see that X 2 is true because 6 > T. So we save a little computation 
by inheriting H(2) and setting H(8) H(2) h ⑷ k ⑹ + h(6)h(7) + h(7)h(9). 

The real action begins to break through a few steps later, when we set / ^ 4 
and T •<— 12. Then ( 62 ) will 12fix not only 4 but also 3, since 4 — ^ 3; and the 
12truth of 3 will soon take us to the simulated step L 8 with u = 6 and v = 9. 
Aha: We 12fix 9, because 6 is 14true. Then we also 12fix the literals 7, 1， •…， and 
reach a contradiction. This contradiction shows that branching on 4 will lead to 
a conflict] hence the literal 4 must be true, if the current clauses are satisfiable. 

Whenever the lookahead simulation of Algorithm X learns that some literal l 
must be true, as in this example，it places l on the FORCE list and makes l proto 
true (that is, true in context PT). A proto true literal will remain fixed true 
throughout this round of lookahead, because all relevant values of T will be 
less than PT. Later, Algorithm L will promote proto truth to near truth, and 
ultimately to real truth — unless a contradiction arises. (And in the case of 
waerden(3^3] 9)，such a contradiction does in fact arise; see exercise 150.) 
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Why does the combination of preorder and postorder work so magically 
in (70)? It’s because of a basic property of forests in general，which we noted 
for example in exercise 2.3.2-20: If u and v are nodes of a forest，u is a proper 
ancestor of v if and only if u precedes v in preorder and u follows v in postorder. 
Moreover, when we look ahead at candidate literals in this way, an important 
invariant relation is maintained on the R stack，namely that truth degrees never 
increase as we move from the bottom to the top: 


VAL[|^_i|] I 1 > VAL[|i ^|]， for 1 < j < E. (71) 

Real truths appear at the bottom, then near truths, then proto truth, etc. For 
example，the stack at one point in the problem above contains seven literals ， 

j -0123456 
^•= 5 6 4 3 9 7 1. 

VALCI^p = RT+1 14 13 12 13 12 12 

One consequence is that the current visibility of truth values matches the recur¬ 
sive structure by which false literals are purged from ternary clauses. 

The second phase of Algorithm X， after preselection of candidates, is called 
“nesting，” because it constructs a lookahead forest analogous to (70). More 
precisely, it constructs a sequence of literals LL [j] and corresponding truth 
offsets L0 [j], for 0 < j < S. It also sets up PARENT pointers to indicate the forest 
structure more directly; for example，with (69) we would have PARENT (8) = 2 
and PARENT(2) = A. 

The third phase, “exploration，” now does the real work. It uses the looka¬ 
head forest to evaluate heuristics H{1) for the candidate literals — and also (if 
it’s lucky) to discover literals whose values are forced. 

The heart of the exploration phase is a breadth-first search based on steps L5, 
L6, and L8. This routine propagates truth values of degree T and also computes 
w, the weight of new binary clauses that would be spawned by branching on l: 


Set lo i- I， i t w t 0, and G ^ E ^ F] perform (62); 
while G < E ， set L i- Rg, G i- G + 1^ and 

take account of (u^v) for all (u^v) in TIMP(L); 
generate new binary clauses (Iq V Wk) for 0 < k < i. 


Here “take account of (u, v)^ means “if either u or is fixed true (in context T), 
do nothing; if both u and v are fixed false，go to CONFLICT; if u is fixed false but 
v isn’t fixed ， set Wi i- i i- i + 1 , and perform (62) with l v] if v is fixed 
false but u isn’t fixed ， set Wi i- i ^ i + 1 , and perform (62) with / ^ if 
neither u nor v is fixed ， set w ^ w + h(u) h(v) 

Explanation: A ternary clause of the form IjM u\! v, where L is fixed true 
and u is fixed false as a consequence of Iq being fixed true, is called a “windfall •” 
Such clauses are good news, because they imply that the binary clause Jo W v 
must be satisfied in the current subproblem. Windfalls are recorded on a stack 
called and appended to the BIMP database at the end of (72). 
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The exploration phase also exploits an important fact called the autarky 
principle, which generalizes the notion of “pure literal” that we discussed above 
in connection with Algorithm A. An “autarky” for a SAT problem F is a set 
of strictly distinct literals A = {ai ， •… ^a t } with the property that every clause 
of F either contains at least one literal of A or contains none of the literals of 
A = {ai, … ， a t }. In other words, A satisfies every clause that A or A “touches •” 
An autarky is a self-sufficient system. Whenever A is an autarky, we can 
assume without loss of generality that all of its literals are actually true; for if F 
is satisfiable, the untouched clauses are satisfiable，and A tells us how to satisfy 
the touched ones. Step X9 of the following algorithm shows that we can detect 
certain autarkies easily while we’re looking ahead. 

Algorithm X (Lookahead for Algorithm L). This algorithm, which is invoked 
in step L2 of Algorithm L，uses the data structures of that algorithm together 
with additional arrays of its own to explore properties of the current subproblem. 
It discovers U > 0 literals whose values are forced, and puts them in the FORCE 
array. It terminates either by (i) satisfying all clauses; (ii) finding a contradiction; 
or (iii) computing heuristic scores H{1) that will allow step L3 to choose a good 
literal for branching. In case (iii) it might also discover new binary clauses. 

XI. [Satisfied?] If F = n, terminate happily (no variables are free). 

X2. [Compile rough heuristics.] Set N = n — F and use ( 65 ) to compute a 
rough score h(l) for each free literal 1. 

X3. [Preselect candidates.] Let C be the current number of free variables that 
are “participants，” and put them into the CAND array. If (7 = 0， set 
C ^ N and put all free variables into CAND; terminate happily, however ， 
if all clauses are satisfied (see exercise 152). Give each variable x in CAND 
the rating r(x) = h(x)h(x). Then while C > 2C max (see ( 66 ))，delete all 
elements of CAND whose rating exceeds the mean rating; but terminate this 
loop if no elements are actually deleted. Finally ，if C > C max , reduce C to 
(7 max by retaining only top-ranked candidates. (See exercise 153.) 

X4. [Nest the candidates.] Construct a lookahead forest，represented in LL [j] 
and L0 [j] for 0 < j < 5 and by PARENT pointers (see exercise 155). 

X5. [Prepare to explore.] Set U' •<— / •<— BASE j •<— 0 and CONFLICT •<— X13. 

X 6 . [Choose l for lookahead.] Set l •(— LL [j] and T BASE + L0 [j]. Set 
H(l) •<— H (PARENT(0), where H( 八 ) = 0. If l is not fixed in context T, go to 
X 8 . Otherwise, if l is fixed false but not proto false, do step X12 with l ^ l. 

X7. [Move to next.] If U > U\ set U' ^ U and j 1 j. Then set j 4 - j + 1. If 
j = set j ^ 0 and BASE 卜 BASE + 2S. Terminate normally if j = j’，or 
if j = 0 and BASE + 25 > PT. Otherwise return to X 6 . 

X 8 . [Compute sharper heuristic.] Perform ( 72 ). Then if > 0， set H(l 0 ) 4 - 
H(Iq) + w and go to X10. 

X9. [Exploit an autarky.] If H(l 0 ) = 0, do step X12 with l t Iq. Otherwise gen¬ 
erate the new binary clause Iq V -iPARENT(Z 0 ) • (Exercise 166 explains why.) 
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X10. [Optionally look deeper.] Perform Algorithm Y below. 

Xll. [Exploit necessary assignments.] Do step X12 for all literals l G BIMP(『 0 ) 
that are fixed true but not proto true. Then go to X7. (See exercise 167.) 

X12. [Force l] Set FORCE[17] <- l, U <- U + 1 ， T 1 i- T, and perform ( 72 ) with 
T ^ PT. Then set T T 1 . (This step is a subroutine，used by other steps.) 

X13. [Recover from conflict.] If T < PT, do step X12 with l ^ Iq and go to X7. 
Otherwise terminate with a contradiction. | 

Notice that, in steps X5-X7, this algorithm proceeds cyclically through the 
forest，continuing to look ahead until completing a pass in which no new forced 
literals are found. The BASE address of truth values continues to grow, if 
necessary，but it isn’t allowed to become too close to PT. 

^Looking even further ahead. If it’s a good idea to look one step ahead, 
maybe it’s a better idea to look two steps ahead. Of course that’s a somewhat 
scary proposition, because our data structures are already pretty stretched; and 
besides, double lookahead might take way too much time. Nevertheless, there’s 
a way to pull it off, and to make Algorithm L run even faster on many problems. 

Algorithm X looks at the immediate consequences of assuming that some 
literal lo is true. Algorithm Y，which is launched in step X10, goes further out 

A 

on that limb, and investigates what would happen if another literal ， Zo, were also 
true. The goal is to detect branches that die off early, allowing us to discover 
new implications of lo or even to conclude that lo must be false. 

For this purpose Algorithm Y stakes out an area of truth space between the 
current context T and a degree of truth called “double truth” or DT ， which is 
defined in step Y2. The size of this area is determined by a parameter which 
is typically less than 10. The same lookahead forest is used to give relative truth 
degrees below DT. Double truth is less trustworthy than proto truth, PT; but 
literals that are fixed at level DT are known to be conditionally true (“Dtrue”) 
or conditionally false (“Dfalse”) under the hypothesis that lo is true. 

Going back to our example of waerden(3, 3; 9)，the scenario described above 
was based on the assumption that double lookahead was not done. Actually, 
however，further activity by Algorithm Y will usually take place after H(l) has 
been set to h ⑶ h ⑶ + h(4)h(7). The value of DT will be set to 130, assuming 
that F = 8, because S = 8. Literal 1 will become Dtrue. Looking then at 2 will 
6 fix 2; and that will 6 fix 3 because of the clause 123. Then 3 will 6 fix 4 and 7, 
contradicting 147 and causing 2 to become Dfalse. Other literals also will soon 
become Dtrue or Dfalse, leading to a contradiction; and that contradiction will 
allow Algorithm Y to make literal 1 proto false before Algorithm X has even 
begun to look ahead at literal 2 . 

The main loop of double lookahead is analogous to ( 72 )，but it’s simpler ， 
because we’re further removed from reality: 

A 

Set Zo f Z and G i- E <- F] perform ( 62 ); 
while G < set L <- Rg, G G + 1^ and (73) 

take account of (u^v) for all (u^v) in TIMP(L). 
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Now “take account of (u^v)^ means “if either w or r is fixed true (in context T )， 
or if neither u nor v is fixed, do nothing; if both u and v are fixed false, go to 
CONFLICT; if u is fixed false but v isn’t fixed，perform ( 62 ) with l i- v; if v is 
fixed false but u isn’t fixed, perform ( 62 ) with l < - u. v 

Since double-looking is costly，we want to try it only when there’s a fairly 
good chance that it will be helpful，namely when H(l 0 ) is large. But how large is 
large enough? The proper threshold depends on the problem being solved: Some 
sets of clauses are handled more quickly by double-looking，while others are im¬ 
mune to such insights. Marijn Heule and Hans van Maaren [LNCS 4501 (2007 )， 
258-271] have developed an elegant feedback mechanism that automatically 
tunes itself to the characteristics of the problem at hand: Let r be a “trigger，” 
initially 0. Step Y1 allows double-look only if H(Iq) > r; otherwise r is decreased 
to /3 丁 ， where ^ is a damping factor (typically 0.999)，so that double - looking 
will become more attractive. On the other hand if double-look doesn’t find a 
contradiction that makes lo proto false，the trigger is raised to HQ 0 ) in step Y 6 . 

Algorithm Y (Double lookahead for Algorithm X). This algorithm, invoked in 
step X10, uses the same data structures (and a few more) to look ahead more 
deeply. Parameters ^ and Y are explained above. Initially DFAIL(0 = 0 for all 1. 

Yl. [Filter.] Terminate if DFAIL(W = ISTAMP, or if T + 2S(Y + 1) > PT. 
Otherwise, if H(Iq) < r, set r (3r and terminate. 

Y2. [Initialize.] Set BASE ^ T-2, LBASE t BASE + 25• F, DT LBASE + L0[j], 
i i- j 1 i- j ^ 0^ E and CONFLICT 4 - Y 8 . Perform ( 62 ) with l Iq 

and T DT. 

Y3. [Choose l for double look.] Set l •<— LL [j] and T •<— BASE + L0 [jf]. If l is not 
fixed in context T, go to Y5. Otherwise, if l is fixed false but not Dfalse，do 
step Y7 with l i- I. 

Y4. [Move to next.] Set j j H- 1. If j = 5, set j 4 — 0 and BASE -f- BASE + 2*5. 
Go to Y 6 if j 1 = j, or if j = 0 and BASE = LBASE. Otherwise return to Y3. 

Y5. [Look ahead.] Perform ( 73 )，and return to Y4 (if no conflict arises). 

Y6. [Finish.] Generate new binary clauses (lo V Wk) for 0 < A: < Then set 
BASE — LBASE, T 1 — DT，r — H (l 0 ) ， DFAIL« 0 ) — ISTAMP, CONFLICT <- 
X13, and terminate. 

Y7. [Make lo false.] Set j 1 j, T 7 T, and perform ( 73 ) with l 4 - to and 
T •<— DT. Then set T T 7 , Wi i- io^ i i 1. (This step is a subroutine.) 

Y 8 . [Recover from conflict.] If T < DT, do step Y7 with l -f- ->LL [j] and go 
to Y4. Otherwise set CONFLICT •<— X13 and exit to X13. | 

Some quantitative statistics will help to ground these algorithms in reality: 
When Algorithm L was let loose on rand(3, 2062,500, 314)，a problem with 500 
variables and 2062 random ternary clauses，it proved unsatisfiability after making 
684,433,234,661 memory accesses and constructing a search tree of 9,530,489 
nodes. Exercise 173 explains what would have happened if various parts of the 
algorithm had been disabled. None of the other SAT solvers we shall discuss are 
able to handle such random problems in a reasonable amount of time. 
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Random satisfiability. There seems to be no easy way to analyze the satisfia¬ 
bility problem under random conditions. In fact，the basic question “How many 
random clauses of 3SAT on n variables do we need to consider，on the average, 
before they can’t all be satisfied?” is a famous unsolved research problem. 

From a practical standpoint this question isn’t as relevant as the analogous 
questions were when we studied algorithms for sorting or searching，because real- 
world instances of 3SAT tend to have highly nonrandom clauses. Deviations from 
randomness in combinatorial algorithms often have a dramatic effect on running 
time，while methods of sorting and searching generally stay reasonably close to 
their expected behavior. Thus a focus on randomness can be misleading. On the 
other hand，random SAT clauses do serve as a nice，clean model，so they give us 
insights into what goes on in Boolean territory. Furthermore the mathematical 
issues are of great interest in their own right. And fortunately, much of the basic 
theory is in fact elementary and easy to understand. So let’s take a look at it. 

Exercise 180 shows that random satisfiability can be analyzed exactly, when 
there are at most five variables. We might as well start there，because the “tiny” 
5-variable case is still large enough to shed some light on the bigger picture. 
When there are n variables and k literals per clause, the number N of possible 


clauses that involve k different variables is clearly 2 k ⑵： There are ⑵ ways to 
choose the variables，and 2 k ways to either complement or not. So we have，for 
example, N = 2 3 ( 3 ) = 80 possible clauses in a 3SAT problem on 5 variables. 

Let q m be the probability that m of those clauses, distinct but otherwise 


selected at random，are satisfiable. Thus q m = Q m j (=) ， where Q m is the number 


of ways to choose m of the N clauses so that at least one Boolean vector x = 
x\ .. .x n satisfies them all. Figure 40 illustrates these probabilities when k = 3 
and n = 5. Suppose we’re being given distinct random clauses one by one. 
According to Fig. 40， the chances are better than 77% that we’ll still be able 
to satisfy them after 20 different clauses have been received，because ^20 ~ 
0.776. But by the time we’ve accumulated 30 of the 80 clauses, the chance of 
satisfiability has dropped to ^30 ~ 0.179; and after ten more we reach ^40 ~ 0.016. 



random satisfiability- 



probability of satisfiability 


Fig- 40- The probability q m that m distinct clauses of 3SAT 
on 5 variables are simultaneously satisfiable，for 0 < m < 80. 

The illustration makes it appear as if g m = 1 for m < 15 ， say, and as if 
g m = 0 for m > 55. But qs is actually less than 1， because of (6); exercise 179 
gives the exact value. And is greater than 0, because Q 70 = 32; indeed，every 
Boolean vector x satisfies exactly (2 k — 1)(=) = (1 — 2~ k )N of the N possible 
clauses，so it’s no surprise that 70 noncontradictory 3 - clauses on 5 variables 
can be found. Of course those clauses will hardly ever be the first 70 received，in 
a random situation. The actual value of q^o is 32/1646492110120 « 2 x 10 -11 . 
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Fig. 41. The total number T m of different Boolean vectors x = xi .. .X 5 that 
simultaneously satisfy m distinct clauses of 3SAT on 5 variables，for 0 < m < 80. 


Figure 41 portrays the same process from another standpoint: It shows in 
how many ways a random set of m clauses can be satisfied. This value, T m , is a 
random variable whose mean is indicated in black, surrounded by a gray region 
that shows the mean plus-or-minus the standard deviation. For example，T 0 is 
always 32, and T\ is always 28; but T 2 is either 24, 25, or 26, and it takes these 
values with the respective probabilities (2200,480,480)/3160. Thus the mean 
for m = 2 is « 24.5, and the standard deviation is « 0.743. 

When m = 20, we know from Fig. 40 that T 2 o is nonzero more than 77% 
of the time; yet Fig. 41 shows that T 2 o ~ 1.47 士 1.17. (Here the notation ju 土 a 
stands for the mean value with standard deviation a.) It turns out, in fact, 
that 20 random clauses are uniquely satisfiable, with T 20 = 1， more than 33% of 
the time; and the probability that T 20 > 4 is only 0.013. With 30 clauses, satis¬ 
fiability gets dicier and dicier: T 30 ~ 0.20 士 0.45; indeed，T 30 is less than 2, more 
than 98% of the time — although it can be as high as 11 if the clause-provider is 
being nice to us. By the time 40 clauses are reached，the odds that T 40 exceeds 1 
are less than 1 in 4700. Figure 42 shows the probability that T m = 1 as m varies. 

•5 



Fig. 42. Pr(T m = 1), the probability that m distinct clauses 
of 3SAT on 5 variables are uniquely satisfiable，for 0 < m < 80. 


Let P be the number of clauses that have been received when we’re first 
unable to satisfy them all. Thus we have P = m with probability p m , where 
p m = q m -i — q m is the probability that m — 1 random clauses are satisfiable but 
m are not. These probabilities are illustrated in Fig. 43. Is it surprising that 
Figs. 42 and 43 look roughly the same? (See exercise 183.) 

The expected “stopping time，” EP，is by definition equal to ^ m rnp m ; and 
it’s not difficult to see, for example by using the technique of summation by 
parts (exercise 1.2.7-10), that we can compute it by summing the probabilities 
in Fig. 40: 

EP = (74) 

m 
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Fig. 43. The stopping time probabilities, p m ，that m distinct clauses 
of 3SAT on 5 variables have just become unsatisfiable，for 0 < m < 80. 
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The variance of P, namely E(P — EP) 2 = (EP 2 ) — (E_P ) 2 ， also has a simple 
expression in terms of the g’s，because 

EP 2 (2m + l)g m . (75) 

m 

In Figs. 40 and 43 we have EP ^ 25.22, with variance ^ 35.73. 

So far we’ve been focusing our attention on 3SAT problems, but the same 
ideas apply also to kSAT for other clause sizes k. Figure 44 shows exact results 
for the probabilities when n = 5 and 1 < A: < 4. Larger values of k give clauses 
that are easier to satisfy, so they increase the stopping time. With five variables 
the typical stopping times for random 1SAT, 2SAT, 3SAT, and 4SAT turn out 

to be respectively 4.06 士 1.19, 11.60 士 3.04, 25.22 士 5.98, and 43.39 士 7.62. In 

general if Pk, n is the stopping time for A:SAT on n variables，we let 

Sk,n = ( 76 ) 


be its expected value. 



Fig. 44. Extension of Fig. 40 to clauses of other sizes. 

Our discussions so far have been limited in another way too: We’ve been 
assuming that m distinct clauses are being presented to a SAT solver for solution. 
In practice, however ， it’s much easier to generate clauses by allowing repetitions ， 
so that every clause is chosen without any dependence on the past history. In 
other words, there’s a more natural way to approach random satisfiability, by 
assuming that N m possible ordered sequences of clauses are equally likely after 
m steps，not that we have (=) equally likely sets of clauses. 

Let q m be the probability that m random clauses (7i • • !\C m are satisfiable ， 

where each Cj is randomly chosen from among the N = 2 k ⑵ possibilities in a 
々 SAT problem on n variables. Figure 45 illustrates these probabilities in the case 
众 = 3, n = 5; notice that we always have q m > q m . If N is large while m is small ， 
it’s clear that q m will be very close to q m , because repeated clauses are unlikely 
in such a case. Still，we must keep in mind that qN is always zero, while q m is 
never zero. Furthermore，the “birthday paradox” discussed in Section 6.4 warns 
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Fig- 45. Random 3SAT on 5 variables when the clauses are sampled 
with replacement. The probabilities q m are shown with a black line; 
the smaller probabilities q m of Fig. 40 are shown in gray. 

us that repetitions aren’t as rare as we might expect. The deviations of q m from 
q m are particularly noticeable in small cases such as the scenario of Fig. 45. 

In any event, there’s a direct way to compute q m from the probabilities qt 
and the value of N (see exercise 184): 



And there are surprisingly simple formulas analogous to ( 74 ) and ( 75 ) for the 
stopping time P, where p m = q m -\ — @ m ，as shown in exercise 186: 


N 


EP 


EP 


2 


E 

m— 0 
N-l 

E 

m=0 
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Qm ] 
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g m ( 1 + 2 


N 


N 


+ 




(79) 


These formulas prove that the expected behavior of P is very much like that 
of P，if q m is small whenever m/N isn’t small. In the case k = 3 and n = 5, the 
typical stopping times P = 30.58 士 9.56 are significantly larger than those of P; 
but we are mostly interested in cases where n is large and where q m is essentially 
indistinguishable from q m . In order to indicate plainly that the probability q m 
depends on k and n as well as on m, we shall denote it henceforth by Sk(m,n)' 


Sk(m^n) = Pr(m random clauses of A:SAT are satisfiable )， ( 80 ) 

where the m clauses are “sampled with replacement” (they needn’t be distinct). 
Suitable pseudorandom clauses rand(k^m^n^ seed) can easily be generated. 

Exact formulas appear to be out of reach when n > 5， but we can make 
empirical tests. For example，extensive experiments on random 3SAT problems 
by B. Selman, D. G. Mitchell, and H. J. Levesque [Artificial Intelligence 81 
(1996) ， 17-29] showed a dramatic drop in the chances of satisfiability when the 
number of clauses exceeds about 4.27n. This “phase transition” becomes much 
sharper as n grows (see Fig. 46). 

Similar behavior occurs for random A:SAT, and this phenomenon has spawned 
an enormous amount of research aimed at evaluating the so-called satisfiability 
thresholds 

o^k = lim Sk^/n. ( 81 ) 

n-^00 
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4.27n 



Fig. 46. Empirical probability data shows that random 3SAT problems rapidly 
become unsatisfiable when there are more than a^n clauses, if n is large enough. 


Indeed，we can obtain quite difficult A:SAT problems by generating approximately 
aun random ^-clauses, using empirically observed estimates of ak- If n is large ， 
the running time for random 3SAT with 4.3n clauses is typically orders of mag¬ 
nitude larger than it is when the number of clauses is 4n or 4.6n. (And still 
tougher problems arise in rare instances when we have, say, 3.9n clauses that 
happen to be unsatisfiable.) 

Strictly speaking ， however, nobody has been able to prove that the so-called 
constants au actually exist, for all k\ The empirical evidence is overwhelming; 
but rigorous proofs for ^ = 3 have so far only established the bounds 


liminf Ss n /n > 3.52; 

n—>-oo 5 — 


lim sup Ss n /n < 4.49 


moo 



[See A. C. Kaporis ， L. M. Kirousis，and E. G. Lalas ， Random Structures & 
Algorithms 28 (2006), 444-480; J. Diaz, L. Kirousis, D. Mitsche, and X. Perez- 
Gimenez, Theoretical Comp. Sci. 410 (2009), 2920-2934.] A “sharp threshold” 
result has been established by E. Friedgut [J. Amer. Math. Soc. 12 (1999) ， 1017- 
1045, 1053—1054]，who proved the existence for A: > 2 of functions o^(n) with 


lim S k ([(a k (n) - e)n\,n) = 1 ， 


n—>-oc 


lim S k ([(a k (n) +e)nj,n) = 0, ( 83 ) 

n —>00 


when e is any positive number. But those functions might not approach a limit. 
They might, for example，fluctuate periodically，like the “wobble function” that 
we encountered in Eq. 5.2. 2 -( 47 ). 

The current best guess for 奶 ， based on heuristics of the “survey propaga¬ 
tion^ technique to be discussed below，is that 0^3 = 4.26675 士 0.00015 [S. Mertens ， 

M. Mezard, and R. Zecchina, Random Structures & Algorithms 28 (2006), 340- 

373]. Similarly, it appears reasonable to believe that ^ 9.931, ^ 21.12, 

«6 ~ 43.37, «7 « 87.79. The a^s grow as Q(2 k ) (see exercise 195); and they 
are known to be constant when k is sufficiently large [see J. Ding ， A. Sly，and 

N. Sun ， STOC 47 (2015)，to appear]. 


Analysis of random 2SAT. Although nobody knows how to prove that random 
3SAT problems almost always become unsatisfiable when the number of clauses 
reaches « 4.27n，the corresponding question for 2SAT does have a nice answer: 
The satisfiability threshold equals 1. For example，when the author first tried 
1000 random 2SAT problems with a million variables, 999 of them turned out to 
be satisfiable when there were 960,000 clauses, while all were unsatisfiable when 
the number of clauses rose to 1 ， 040,000. Figure 47 shows how this transition 
becomes sharper as n increases. 
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Fig. 47. Empirical satisfaction 
probabilities for 2SAT with ap¬ 
proximately n random clauses. 
(When n = 100, the probability 
doesn’t become negligible until 
more than roughly 180 clauses 
have been generated.) 


n — 1000000 n — 10000 n — 100 



The fact that S 2 , n w n was discovered in 1991 by V. Chvatal and B. Reed 
[FOCS 33 (1992), 620-627], and the same result was obtained independently 
at about the same time by A. Goerdt and by W. Fernandez de la Vega [see J. 
Comp. Syst. Sci. 53 (1996), 469-486; Theor. Comp. Sci. 265 (2001), 131-146]. 

The study of this phenomenon is instructive, because it relies on properties 
of the digraph that characterizes all instances of 2SAT. Furthermore，the proof 
below provides an excellent illustration of the “first and second moment princi¬ 
ples,^ equations MPR—( 21 ) and MPR—( 22 ). Armed with those principles ， we’re 
ready to derive the 2SAT threshold: 

Theorem C. Let c be a fixed constant. Then 

( 84 ) 

Proof. Every 2SAT problem corresponds to an implication digraph on the literals, 
with arcs I ― and V — >1 for each clause l\/ V. We know from Theorem 7.1.IK 
that a set of 2SAT clauses is satisfiable if and only if no strong component of its 
implication digraph contains both x and x for some variable x. That digraph 
has 2m = 2|_cn」arcs and 2n vertices. If it were a random digraph, well-known 
theorems of Karp (which we shall study in Section 7.4.1) would imply that only 
O(logn) vertices are reachable from any given vertex when c < 1 ， but that there 
is a unique “giant strong component” of size fl(n) when c > 1. 

The digraph that arises from random 2SAT isn’t truly random, because its 
arcs come in pairs, u — >v and v — >u. But intuitively we can expect that similar 
behavior will apply to digraphs that are just halfway random. For example, when 
the author generated a random 2SAT problem with n = 1000000 and m = .99n ， 
the resulting digraph had only two complementary pairs of strong components 
with more than one vertex, and their sizes were only 2, 2 and 7, 7; so the clauses 
were easily satisfiable. Adding another .01n clauses didn’t increase the number of 
nontrivial strong components，and the problem remained satisfiable. But another 
experiment with m — n — 1000000 yielded a strong component of size 420, 
containing 210 variables and their complements; that problem was unsatisfiable. 

Based on a similar intuition into the underlying structure, Chvatal and 
Reed introduced the following “snares and snakes” approach to the proof of 
Theorem C: Let’s say that an s-chain is any sequence of s strictly distinct literals; 
thus there are 2 s n- possible s-chains. Every 5 -chain C corresponds to clauses 

(『1 V ,2) ，（ J2VZ3) ， … ， (l s -l v / s ), (85) 


lim S 2 (|_cn 」 ， n) 


n —>00 


1 ， if c < 1 ; 
0 , if c > 1 . 
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which in turn correspond to two paths 5-snare 

first moment principle 

h — >l 2 —— 一 >l s and l s — . — >ls — >l 2 — >h ( 86 ) ^~ snake 

in the digraph. An s-snare (C •，仁 u) consists of an s-chain (7 and two indices t 
and where 1 < t < s and 1 < |u| < s] it specifies the clauses ( 85 ) together with 

(It V h) and (J s \/ l u ) if ^ > 0, (l s V l- u ) if w < 0, ( 87 ) 

representing l t ― and either l s — \ l\ u \ or l s — \ l\ u \. The number of possible 
s-snares is 2 S+I (s — l) 2 n-. Their clauses are rarely all present when m is small. 

Exercise 200 explains how to use these definitions to prove Theorem C in 
the case c < 1. First we show that every unsatisfiable 2SAT formula contains all 
the clauses of at least one snare. Then，if we define the binary random variable 


X(C]t ， u) = [all clauses of (C]t ， u) are present] ， 


( 88 ) 


it isn’t difficult to prove that the snares of every s-chain C are unlikely: 

EX(C;t,u) < m s+ 1 /(2n(n-l)) 朴 1 . ( 8 g) 

Finally, letting X be the sum of X(C]t ， u) over all snares，we obtain 


EX = < ^2" + 1 8(5-lK 

5>0 



2 

n 



3 


by Eq. 1.2.9—( 20 ). This formula actually establishes a stronger form of ( 84 )， 
because it shows that EX is only 0(n -1 / 4 ) when m — n — n 3 / 4 > cn. Thus 

5 2 (Ln -n 3/4 J,n) > Pr(X = 0) = 1 - Pr(X > 0) > 1 - 0(n- 1/4 ) (go) 
by the first moment principle. 

The other half of Theorem C can be proved by using the concept of a t- 
snake, which is the special case (C]t, —t) of a (2t — l)-snare. In other words， 
given any chain (“，•••，&，••• ，/ 2 t-i)，with s = 2t — 1 and It 
in the middle，a t-snake generates the clauses ( 85 ) together 
with (l t V l\) and (l s V l t ). When t = 5, for example，and 
(h, •••, ht-i) = ($ 1 ， …，吻）， the 2t = 10 clauses are 

51, 12, 23, 34, 45, 56, 67, 78, 89, 95, 

and they correspond to 20 arcs that loop around to form a 
strong component as shown here. We will prove that, when c > 1 
in ( 84 )，the digraph almost always contains such impediments to satisfiability. 

Given a (2t — 1)-chain C，where the parameter t will be chosen later，let 



Xc = [each clause of (C't, —t) occurs exactly once 
The expected value EXc is clearly f (2t), where 


(91) 
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f(r) = m- (2n(n — 1) — r) m r / (2n(n — l)) m 


(92) 
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is the probability that r specific clauses occur once each. Notice that 


fir) 


m 


2n(n — 1 ) 



1 + 0 (-) 

mJ 




thus the relative error will be 0(t 2 /n) if m = @(n) as n ^ oo. 

Now let X = summed over all R = 2 2 t_ 1 n^^ possible t-snakes C\ 

thus EX = Rf(2t). We want to show that Pr(X > 0) is very nearly 1， using 
the second moment principle; so we want to show that the expectation EX 2 = 
E Xc) Xd) = EXcXd is small. The key observation is that 

EXcXd = f(4t — r) if C and D have exactly r clauses in common. ( 94 ) 


Let p r be the probability that a randomly chosen t-snake has exactly r clauses 
in common with the fixed snake Then 


EX 2 _ R 2 Y ： r = 0 Prf^t-r) 

(EX ) 2 _ R 2 f(2t) 2 


2t 

r—0 


f(4t - r) 

/ ㈤ 2 


2t 

= J^Pr 


r—0 




1 + 0 




By studying the interaction of snakes (see exercise 201) one can prove that 

(2n) r p r = 0(t 4 /n) + 0(t)[r >t]-\- 0(n)[r = 2 t], for 1 < r < 2t. (g 6 ) 

Finally then, as explained in exercise 202, we can choose t = 卜 1 , 5 」 and m = 
Ln + n 5 / 6 」 ， to deduce a sharper form of ( 84 ) when c > 1 : 

5 2 (Ln + n 5 / 6 J,n) = 0(n _1/3 °). ( 97 ) 

(Deep breath.) Theorem C is proved. | 

Much more precise results have been derived by B. Bollobas, C. Borgs, J. T. 
Chayes, J. H. Kim, and D. B. Wilson，in Random Structures & Algorithms 18 
(2001), 201-256. For example, they showed that 

S 2 (ln — n 3j/4 \ , n) = exp(— 0 (n -1 / 4 )); S 2 (ln-\-n 3j/4 \ , n) = exp(—©(n 1 / 4 )). ( 98 ) 

Resolution. The backtracking process of Algorithms A, B, D, and L is closely 
connected to a logical proof procedure called resolution. Starting with a family of 
clauses called “axioms，” there’s a simple rule by which new clauses can be derived 
from this given set: Whenever both x\/ A 1 and x V A n are in our repertoire of 
clauses ， we’re allowed to derive the “resolvent” clause A = A f W A ,f ^ denoted by 
(x \/ A') o (x \/ A"). (See exercises 218 and 219.) 

A proof by resolution consists of a directed acyclic graph (dag) whose vertices 
are labeled with clauses in the following way: (i) Every source vertex is labeled 
with an axiom, (ii) Every other vertex has in-degree 2. (iii) If the predecessors 
of vertex v are v 1 and v 11 ^ the label of v is C(v) = C(v r ) o C{v n ). 

When such a dag has a sink vertex labeled we call it a “resolution proof 
of 乂”； and if A is the empty clause, the dag is also called a “resolution refutation •” 
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The dag of a proof by resolution can be expanded to a binary tree, by 
replicating any vertex that has out-degree greater than 1. Such a tree is said 
to be regular if no path from the root to a leaf uses the same variable twice to 
form a resolvent. For example, Fig. 48 is a regular resolution tree that refutes 
Rivest’s unsatisfiable axioms ( 6 ). All arcs in this tree are directed upwards. 





234 


341 


234 


234 


341 


234 


341 


regular resolution 
treelike resolution 
Rivest 

unnecessary branch 
lookahead 
Impagliazzo 
Pudlak 

Prover—Delayer game 


Fig- 48 - One way to derive e by resolving the inconsistent clauses (6). 

Notice that Fig. 48 is essentially identical to Fig. 39 on page 33, the backtrack 
tree by which Algorithm D discovers that the clauses of ( 6 ) are unsatisfiable. 
In fact this similarity is no coincidence: Every backtrack tree that records 
the behavior of Algorithm D on a set of unsatisfiable clauses corresponds to a 
regular resolution tree that refutes those axioms, unless Algorithm D makes an 
unnecessary branch. (An unnecessary branch occurs if the algorithm tries x <- 0 
and x i- 1 without using their consequences to discover an unsatisfiable subset 
of axioms.) Conversely，every regular refutation tree corresponds to a sequence 
of choices by which a backtrack-based SAT solver could prove unsatisfiability. 

The reason behind this correspondence isn’t hard to see. Suppose both 
values of x need to be tried in order to prove unsatisfiability. When we set 
x ^ 0 in one branch of the backtrack tree, we replace the original clauses F 
by F I 龙 ， as in ( 54 ). The key point is that we can prove the empty clause by 
resolution from F\x if and only if we can prove x by resolution from F without 
resolving on x. (See exercise 224.) Similarly，setting x i- 1 corresponds to 
changing the clauses from F to F\x. 

Consequently, if F is an inconsistent set of clauses that has no short refuta¬ 
tion tree, Algorithm D cannot conclude that those clauses are unsatisfiable unless 
it runs for a long time. Neither can Algorithm L, in spite of enhanced lookahead. 

R. Impagliazzo and P. Pudlak [SODA 11 (2000), 128-136] have introduced 

an appealing Prover—Delayer game, with which it’s relatively easy to demon¬ 
strate that certain sets of unsatisfiable clauses require large refutation trees. 
The Prover names a variable and the Delayer responds by saying either x ^ 0 
or x 4- 1 or x 4- In the latter case the Prover gets to decide the value of x\ but 
the Delayer scores one point. The game ends when the current assignments have 
falsified at least one clause. If the Delayer has a strategy that guarantees a score 
of at least m points, exercise 226 shows that every refutation tree has at least 2 m 
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leaves; hence at least 2 m —1 resolutions must be done，and every backtrack-based 
solver needs H( 2 m ) operations to declare the clauses unsatisfiable. 

We can apply their game，for example，to the following interesting clauses: 


anti-maximal-element clauses 

binary relation 

irreflexive 

transitive 

partial ordering 

maximal 

Lauria 

Stalmarck 

hyperresolution 


There are mr variables Xjk, for 1 < j，k < m ， which we can regard as the inci¬ 
dence matrix for a binary relation c j 乂 k’• With this formulation, ( 99 ) says that 
the relation is irreflexive，and ( 100 ) says that it’s transitive; thus ， ( 99 ) and ( 100 ) 
amount to saying that j k is a partial ordering. Finally ， ( 101 ) says that, for ev¬ 
ery j , there’s a A: with j -< k. So these clauses state that there’s a partial ordering 
on {1 ， … ， m} in which no element is maximal; and they can’t all be satisfied. 

We can ， however, always score m — 1 points if we’re playing Delayer in that 
game，by using the following strategy suggested by Massimo Lauria: At every 
step we know an ordered set S of elements，regarded as “small ”； initially 5 = 0 ， 
and we’ll have S = {ji ^... ， when our score is s. Suppose the Prover queries 
Xjk, and s < m — 2. If j = we naturally reply that Xjk ^ 0. Otherwise ，if j ^ S 
and k ^ S^we respond Xjk *； then s l s + 1 ， and j $ i- j or k according as the 
Prover specifies Xjk 卜 1 or Xjk ^ 0. Otherwise，if j G 5 and k 丰 S , 观 respond 
Xjk 4- 1; if j ^ 5 and k ^ we respond Xjk 0. Finally, if j = j a ^ S and k = 
jb G we respond Xjk ^ [a < b]. These responses always satisfy ( 99 ) and ( 100 ). 
And no clause of ( 101 ) becomes false before the Delayer is asked a question with 
s = m — 2. Then the response Xjk * gains another point. We’ve proved 

Theorem R. Every refutation tree for the clauses ( 99 ) ， ( 100 ) ， ( 101 ) represents 
at least 2 m_1 — 1 resolution steps. | 

On the other hand，those clauses do have a refutation dag of size 0(m 3 ). 
Let Ij and stand for the irreflexivity and transitivity axioms ( 99 ) and ( 100 ); 
and let — Xji V • • • V Xjk，so that ( 101 ) is Mj m . Then we have 

-^-im ^ T^imk ~ ^i(m—1) V X m k ， for 1 i ， k 〈 Tfl. (102) 

Calling this new clause M[ mk , we can now derive 

^j(m-l) — (( ... mm ◊ Mj ml ) O Mj m2 ) ◊...)◊ 0-^m 5 

for 1 < j < m. Hence (m — l ) 2 + (m — l)m resolutions have essentially reduced 
m to m — 1. Eventually we can therefore derive Mu ； then M n o Ii = e. [This 
elegant refutation is due to G. Stalmarck, Acta Informatics, 33 (1996) ， 277-280.] 

The method we’ve just used to obtain from M mm is, incidentally, 

a special case of a useful general formula called hyperresolution that is easily 
proved by induction on r: 

(... ((Co V A V ... V $ r ) ◊ (G V 办 ）)◊...)◊ (C r V 心） 

=Co V Cl V • • • V C r . ( 103 ) 


、/ \)/ 
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*Lower bounds for general resolution. Let’s change our perspective slightly: 
Instead of visualizing a proof by resolution as a directed graph, we can think of it 
as a “straight line” resolution chain, analogous to the addition chains of Section 
4.6.3 and the Boolean chains of Section 7.1.2. A resolution chain based on m 
axioms C\^ … ， C m appends additional clauses C^+i ， … ， （ 7 m + r ，each of which 
is obtained by resolving two previous clauses of the chain. Formally, we have 

Ci = Cj(j) ◊ Q ⑷， for m+l<i<m + r, ( 104 ) 

where 1 < j(i) < i and 1 < k(i) < i. It’s a refutation chain for (7i，• • • ， C m if 
(7 m+r = e. The tree in Fig. 48, for example，yields the refutation chain 

123,234,341,412,123,234,341,412,123,124,123,124,12,12,12,12,1,1,6 

for the axioms ( 6 ); and there are many other ways to refute those axioms，such as 

123,234,341,412,123,234,341,412,123,13,14,34,24,24,2,13,34,14,3,1,1,e. ( 105 ) 

This chain is quite different from Fig. 48, and perhaps nicer: It has three more 
steps，but after forming c 123’ it constructs only very short clauses. 

We’ll see in a moment that short clauses are crucial if we want short chains. 
That fact turns out to be important when we try to prove that certain easily 
understood families of axioms are inherently more difficult than ( 99 ) ，（ 100 )， 
and ( 101 )，in the sense that they can’t be refuted with a chain of polynomial size. 

Consider，for example, the well known “pigeonhole principle，” which states 
that m+1 pigeons don’t fit in m pigeon-sized holes. If Xjk means that pigeon j oc¬ 
cupies hole for 0 < j < m and 1 < k < the relevant unsatisfiable clauses are 

(xji V Xj2 V … V Xjra)^ for 0 < j < m; (106) 

(xik V Xjk)^ for 0 < i < j < m and 1 < k < m. (107) 

(“Every pigeon has a hole，but no hole hosts more than one pigeon •”） These 
clauses increased the pigeonhole principle’s fame during the 1980s，when Armin 
Haken [Theoretical Computer Science 39 (1985), 297-308] proved that they have 

no short refutation chain. His result marked the first time that any set of clauses 
had been shown to be intractable for resolution in general. 

It is absolutely necessary that two people have equally many hairs. 
— JEAN APPIER HANZELET, Recreation Mathematicque (1624) 

Haken’s original proof was rather complicated. But simpler approaches were 
eventually found, culminating in a method by E. Ben-Sasson and A. Wigderson 

[JACM 48 (2001), 149-169], which is based on clause length and applies to 
many other sets of axioms. If a is any sequence of clauses, let us say that its 
width, written w(a)^ is the length of its longest clause or clauses. Furthermore, 
if ao = (Ci ， •… ， （ 7 m )，we write ^(ao h e) for the minimum of w(a) over all 
refutation chains a = ((7i ， … ， C m + r ) for ao，and ||ao 卜 e|| for the minimum 
length r of all such chains. The following lemma is the key to proving lower 
bounds with Ben-Sasson and Wigderson’s strategy: 


resolution chain 
refutation chain 
pigeonhole principle 
Haken 

APPIER HANZELET 
Ben-Sasson 


Wigderson 

width 


notation 

notation 

notation 


w(a) 
w(a h e) 
\\ahC\\ 


September 23, 2015 




58 COMBINATORIAL ALGORITHMS (F 6 A: 23 Sep 2015 @1959) 7.2.2.2 

Lemma B. ||ao e|| > e (^(^ot-€)-i) 2 /( 8 n) — 2 , for clauses in n > w(ao) 2 variables. 
Thus there’s exponential growth if we have w(ao) = 0(1) and w(ao h e) = 

Proof. Let a = (Ci ， … ， (7 m + r ) be a refutation of ao with r = ||ao h e||. We 
will say that a clause is “fat” if its length is W or more, where W > w(ao) is a 
parameter to be set later. If a \ ao contains / fat clauses，those clauses contain 
at least Wf literals; hence some literal l appears in at least Wf /(2n) of them. 

Now a I /， the chain obtained by replacing each clause Cj by Cj | is a 
refutation of ao | Z that contains at most Lp/」fat clauses，where p = 1 — W/(2n). 
(The clause Cj \ l will be p if / G Cj, thus tautological and effectively absent.) 

Suppose / < p~ b for some integer b. We will prove, by induction on b and 
secondarily on the total length of all clauses，that there’s a refutation of ao 
such that w(/3) <W + b. This assertion holds when 6 = 0, since W > w(ao). If 
6 > 0 , there’s a refutation Pq of ao | Z with w(Po) < VF+ 6 — 1 ， when we choose l as 
above, because pf < p 1 — & and a 1 1 refutes ao I i. Then we can form a resolution 
chain that derives l from a。，by inserting l appropriately into clauses of 
And there’s a simple chain 灸 that derives the clauses of ao | T from a。and l. 
There’s also a refutation of ao | ^ with w(/3s) < VF + 6 , by induction，because 
a 1 1 refutes ao IThus the combination [5 = {Hlh} refutes a。，with 

w(P) = max(w(/?o) + l,io(/?2),^(/?3)) < max(W + 6, w(ao ), W-\-b) = W+b. 

Finally, exercise 238 chooses W so that we get the claimed bound. | 

The pigeon axioms are too wide to be inserted directly into Lemma B. But 
Ben-Sasson and Wigderson observed that a simplified version of those axioms ， 
involving only clauses of 5SAT，is already intractable. 

Notice that we can regard the variable Xjk as indicating the presence of an 
edge between aj and bk in a bipartite graph on the vertices A = {a。，• • •, a m } 
and B = {bi^...，b m }. Condition ( 106 ) says that each aj has degree > 1, while 
condition ( 107 ) says that each bk has degree < 1. There is ， however, a bipartite 
graph Go on those vertices for which each aj has degree < 5 and such that the 
following strong “expansion” condition is satisfied: 

Every subset A' C A with \A'\ < m/3000 has \dA'\ > |^| in Go- ( 108 ) 

Here dA! denotes the bipartite boundary of A! ^ namely the set of all bk that have 
exactly one neighbor in A!. 

Given such a graph whose existence is proved (nonconstructively) in 
exercise 240, we can formulate a restricted pigeonhole principle, by which the pi¬ 
geonhole clauses are unsatisfiable if we also require Xjk whenever aj -f-bk in Gq. 

Let a(Go) denote the resulting clauses，which are obtained when axioms 
( 106 ) and ( 107 ) are conditioned on all such literals Xjk- Then w(a(Go)) < 5, 
and at most 5m + 5 unspecified variables Xjk remain. Lemma B tells us that 
all refutation chains for a(Go) have length expH(m) if we can prove that they 
all have width Haken’s theorem, which asserts that all refutation chains 

for ( 106 ) and ( 107 ) also have length expH(m)，will follow，because any short 
refutation would yield a short refutation of a(Go) after conditioning on the x^. 
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Thus the following result gives our story a happy ending: 

Theorem B. The restricted pigeonhole axioms a (Go) have refutation width 

w(a(Go) e) > m/6000. ( 109 ) 

Proof. We can assign a complexity measure to every clause C by defining 

/x((7) = min{ |^4’| \ A' C A and a{A') h (7}. ( 110 ) 

Here a(A f ) is the set of “pigeon axioms” ( 106 ) for aj G A!^ together with all 
of the “hole axioms” ( 107 ); and a{A ! ) h C means that clause C can be proved 
by resolution when starting with only those axioms. If C is one of the pigeon 
axioms，this definition makes fi(C) = 1, because we can let A 1 = {aj}. And if C 
is a hole axiom，clearly fi(C) = 0. The subadditive law 

^(c'oc") < (in) 

also holds，because a proof of C r o C" needs at most the axioms of a(A ! ) U a(A n ) 
if C' follows from a(A') and C" follows from a(A"). 

We can assume that m > 6000. And we must have fi(e) > m/3000, because 
of the strong expansion condition ( 108 ). (See exercise 241.) Therefore every refu¬ 
tation of a((^o) must contain a clause C with m/6000 < fi(C) < m/3000; indeed ， 
the first clause Cj with > m/6000 will satisfy this condition，by ( 111 ). 

Let A 1 be a set of vertices with |A’| = fi(C) and a(A ! ) h C. Also let bk be 
any element of dA!^ with aj its unique neighbor in A!. Since \A! \ a^| < 
there must be an assignment of variables that satisfies all axioms of a(A f \ aj), 
but falsifies C and the pigeon axiom for jL That assignment puts no two pigeons 
into the same hole, and it places every pigeon of A! \ aj. 

Now suppose C contains no literal of the form or for any ay G A. 
Then we could set 0 for all jf’，without falsifying any axiom of a(A f \ aj)] 

and we could then make the axioms of a({aj}) true by setting Xjk <— 1. But 
that change to the assignment would leave C false，contradicting our assumption 
that a(A f ) h C. Thus C contains some 士 for each bk G dA 1 ] and we must 
have w(C) > \dA'\ > m/6000. ■ 

A similar proof establishes a linear lower bound on the refutation width ， 
hence an exponential lower bound on the refutation length，of almost all random 
3SAT instances with n variables and [cm 」 clauses，for fixed a as n ^ 00 (see exer¬ 
cise 243), a theorem of V. Chvatal and E. Szemeredi [JACM 35 (1988), 759-768]. 

Historical notes: Proofs by resolution, in the more general setting of first 
order logic, were introduced by J. A. Robinson in JACM 12 (1965), 23-41. 
[They’re also equivalent to G. Gentzen’s “cut rule for sequents，” Mathematische 
Zeitschrift 39 (1935), 176-210, III.1.2 1.] Inspired by Robinson’s paper, Greg¬ 
ory Tseytin developed the first nontrivial techniques to prove lower bounds on 
the length of resolution proofs, based on unsatisfiable graph axioms that are 
considered in exercise 245. His lectures of 1966 were published in Volume 8 
of the Steklov Mathematical Institute Seminars in Mathematics (1968); see 
A. O. Slisenko’s English translation, Studies in Constructive Mathematics and 
Mathematical Logic, part 2 (1970) ， 115-125. 
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Tseytin pointed out that there’s a simple way to get around the lower bounds 
he had proved for his graph - oriented problems, by allowing new kinds of proof 
steps: Given any set of axioms we can introduce a new variable 2 ： that doesn’t 
appear anywhere in F^ and add three new clauses G = {xz^ yz^ xyz}] here x 
and y are arbitrary literals of F. It’s clear that F is satisfiable if and only if 
F U G is satisfiable, because G essentially says that 2 ： = NAND(x,y). Adding 
new variables in this way is somewhat analogous to using lemmas when proving 
a theorem, or to introducing a memo cache in a computer program. 

His method, which is called extended resolution, can be much faster than 
pure resolution. For example, it allows the pigeonhole clauses ( 106 ) and ( 107 ) to 
be refuted in only 0(m 4 ) steps (see exercise 237). It doesn’t appear to help much 
with certain other classes of problems such as random 3SAT; but who knows? 


SAT solving via resolution. The concept of resolution also suggests alternative 
ways to solve satisfiability problems. In the first place we can use it to eliminate 
variables: If F is any set of clauses on n variables, and if x is one of those 
variables，we can construct a set F 1 of clauses on the other n — 1 variables in 
such a way that F is satisfiable if and only if F 1 is satisfiable. The idea is simply 
to resolve every clause of the form xW A 1 with every clause of the form x\/ A n ^ 
and then to discard those clauses. 

For example，consider the following six clauses in four variables: 

1234, 12, 123, 13, 23, 34. ( 112 ) 

We can eliminate the variable X 4 by forming 1234 o 34 = 123. Then we can 
eliminate X 3 by resolving 123 and 13 with 123 and 23: 

123 o 123 = p, 123 o 23 = 12, 13ol23 = 12, 13o23 = 12. 

Now we’re left with {12, 12,12, 12}，because the tautology p goes away. Elimi¬ 
nating X 2 gives {1 ， I}，and eliminating x\ gives {e}; hence ( 112 ) is unsatisfiable. 

This method, which was originally proposed for hand calculation by E. W. 
Samson and R. K. Mueller in 1955, works beautifully on small problems. But 
why is it valid? There are (at least) two good ways to understand the reason. 
First ， it’s easy to see that F 1 is satisfiable whenever F is satisfiable, because 
C f o C n is true whenever C f and C" are both true. Conversely, if F 1 is satisfied 
by some setting of the other n — 1 variables, that setting must either satisfy A 1 
for all clauses of the form x\/ A 1 ^ or else it must satisfy A n for all clauses of the 
form x\/ A n . (Otherwise neither A 1 nor A n would be satisfied, for some A! and 
some _A 〃， and the clause A 1 W A 11 in F 1 would be false.) Thus at least one of the 
settings : r 卜 0 or t 卜 1 will satisfy F. 

Another good way to understand variable elimination is to notice that it 
corresponds to the elimination of an existential quantifier (see exercise 248). 

Suppose p clauses of F contain x and q clauses contain x. Then the elimina¬ 
tion of x will give us at most pq new clauses, in the worst case; so F 1 will have no 
more clauses than F did，whenever pq < p + namely when (p — l)(g — 1) < 1. 
This condition clearly holds whenever p = 0 or g = 0; indeed, we called x a 
“pure literal” when such cases arose in Algorithm A. The condition also holds 
whenever p = 1 or g = 1, and even when p = q = 2. 
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Furthermore we don’t always get pq new clauses. Some of the resolvents 
might turn out to be tautologous, as above; others might be subsumed by existing 
clauses. (The clause C is said to subsume another clause if C C C\ in the 
sense that every literal of C appears also in C f • In such cases we can safely 
discard C 1 .) And some of the resolvents might also subsume existing clauses. 

Therefore repeated elimination of variables doesn’t always cause the set of 
clauses to explode. In the worst case ， however，it can be quite inefficient. 

In January of 1972, Stephen Cook showed his students at the University of 
Toronto a rather different way to employ resolution in SAT-solving. His elegant 
procedure, which he called “Method I，” essentially learns new clauses by doing 
resolution on demand: 

Algorithm I (Satisfiability by clause learning). Given m nonempty clauses 
(7i 八…八 C m on n Boolean variables x\ …: r n ，this algorithm either proves them 
unsatisfiable or finds strictly distinct literals h .. .l n that satisfy them all. In the 
process，new clauses may be generated by resolution (and m will then increase). 

11. [Initialize.] Set d 0. 

12. [Advance.] If d = n, terminate successfully (the literals {’ 1 ， •… Jd} satisfy 
{Ci ， … ， C m }). Otherwise set d ^ d+1，and let Id be a literal strictly distinct 
from … ， ld-i- 

13. [Find falsified CV] If none of C\^ … ， C m are falsified by {L ， … ， 4/}，go back 
to 12. Otherwise let Ci be a falsified clause. 

14. [Find falsified Cj] (At this point we have Id ^ Ci ^ { 『 1 ， …， &}，but no 
clause is contained in |7i， …， Set Id 4 — Id- If none of (7i ， … ， C m are 
falsified by {7i ， … ， 4/}，go back to 12. Otherwise let Id ^ Cj ^ {h^ • ^ 4/}- 

15. [Resolve.] Set m 4 - m + 1 ， C m ^ Ci o Cj. Terminate unsuccessfully if C m is 

empty. Otherwise set d ^ max{t | It G C m }^ i i- m, and return to 14. | 

In step 15 the new clause C m cannot be subsumed by any previous clause Ck for 
k < m, because Ci o Cj C {& ， ... ， Id-i}- Therefore，in particular, no clause is 
generated twice，and the algorithm must terminate. 

This description is intentionally vague when it uses the word “let” in steps 
12,13, and 14: Any available literal Id can be selected in step 12, and any falsified 
clauses Ci and Cj can be selected in steps 13 and 14, without making the method 
fail. Thus Algorithm I really represents a family of algorithms, depending on 
what heuristics are used to make those selections. 

For example，Cook proposed the following way (“Method IA”）to select Id 
in step 12: Choose a literal that occurs most frequently in the set of currently 
unsatisfied clauses that have the fewest unspecified literals. When applied to the 
six clauses ( 112 )， this rule would set 卜 3 and I 2 ^ and Z 3 卜 1; then step 13 
would find Ci = 123 false. So step 14 would set Is ^ 1 and find Cj = 12 false ， 
and step 15 would learn C 7 = 23. (See exercise 249 for the sequel.) 

Cook’s main interest when introducing Algorithm I was to minimize the 
number of resolution steps; he wasn’t particularly concerned with minimizing 
the running time. Subsequent experiments by R. A. Reckhow [Ph.D. thesis 
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(Univ. Toronto ， 1976) ， 81—84] showed that, indeed，relatively short resolution 
refutations are found with this approach. Furthermore, exercise 251 demon¬ 
strates that Algorithm I can handle the anti - maximal - element clauses ( 99 )-( 101 ) 
in polynomial time; thus it trounces the exponential behavior exhibited by all 
backtrack-based algorithms for this problem (see Theorem R). 

On the other hand, Algorithm I does tend to fill memory with a great many 
new clauses when it is applied to large problems, and there’s no obvious way to 
deal with those clauses efficiently. Therefore Cook’s method did not appear to be 
of practical importance, and it remained unpublished for more than forty years. 

Conflict driven clause learning. Algorithm I demonstrates the fact that 
unsuccessful choices of literals can lead us to discover valuable new clauses ， 
thereby increasing our knowledge about the characteristics of a problem. When 
that idea was rediscovered from another point of view in the 1990s, it proved to 
be revolutionary: Significant industrial instances of SAT with many thousands 
or even millions of variables suddenly became feasible for the first time. 

The name CD CL solver is often given to these new methods，because they are 
based on “conflict driven clause learning” rather than on classical backtracking. 
A CDCL solver shares many concepts with the DPLL algorithms that we’ve 
already seen; yet it is sufficiently different that we can understand it best by 
developing the ideas from scratch. Instead of implicitly exploring a search tree 
such as Fig. 39， a CDCL solver is built on the notion of a trail, which is a 
sequence LqLi ... Lp-i of strictly distinct literals that do not falsify any clause. 
We can start with F = 0 (the empty trail). As computation proceeds, our task 
is to extend the current trail until F = n, thus solving the problem, or to prove 
that no solution exists，by essentially learning that the empty clause is true. 

Suppose there a clause c of the form l\! a\M •… M a]^ where a\ through 
are in the trail but l isn’t. Literals in the trail are tentatively assumed to 
be true，and c must be satisfied; so we’re forced to make l true. In such 
cases we therefore append l to the current trail and say that c is its “reason.” 
(This operation is equivalent to what we called “unit propagation” in previous 
algorithms; those algorithms effectively removed the literals a\^ ... when they 
became false, thereby leaving l as a “unit” all by itself. But our new viewpoint 
keeps each clause c intact, and knows all of its literals.) A conflict occurs if the 
complementary literal l is already in the trail，because l can’t be both true and 
false; but let’s assume for now that no conflicts arise, so that l can legally be 
appended by setting Lp ^ I and F ^ F + 1. 

If no such forcing clause exists，and if F < n, we choose a new distinct literal 
in some heuristic way, and we append it to the current trail with a “reason” 
of A. Such literals are called decisions. They partition the trail into a sequence 
of decision levels, whose boundaries can be indicated by a sequence of indices 
with 0 = io < ^1 < ^2 < ^3 < • • • ； literal L t belongs to level d if and only if 
id < ^d+i. Level 0, at the beginning of the trail, is special: It contains literals 
that are forced by clauses of length 1， if such clauses exist. Any such literals are 
unconditionally true. Every other level begins with exactly one decision. 
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Consider，for example，the problem waerden(3^3]9) of (g). The first items 
placed on the trail might be 




2 



4 




level 


1 

2 

2 




3 




reason 


A 

(a decision) 

A 

(a decision) 

396 

(rearrangement of the clause 369) 

A 

(a decision) 

546 

(rearrangement of the clause 456) 

846 

(rearrangement of the clause 468) 

246 


753 

(rearrangement of the clause 357) 

258 

(a conflict!) 


( 叫 ) 


Three decisions were made, and they started levels at i\ = 0 ， <2 = 1 ， <3 = 3. 
Several clauses have been rearranged; we’ll soon see why. And propagations have 
led to a conflict，because both 2 and 2 have been forced. (We don’t actually 
consider the final entry L s to be part of the trail，because it contradicts L 6 .) 

If the reason for l includes the literal 『’， we say “Z depends directly on "•” 
And if there’s a chain of one or more direct dependencies, from Z to Zi to ••• to 
Ik = we say simply that u l depends on V•” For example，5 depends directly 


on 4 and 6 in ( 113 )，and 2 depends directly on 5 and 8 ; hence 2 depends on 6 . 

Notice that a literal can depend only on literals that precede it in the trail. 
Furthermore, every literal l that’s forced at level d > 0 depends directly on some 
other literal on that same level d] otherwise l would already have been forced at 
a previous level. Consequently l must necessarily depend on the dth decision. 


The reason for reasons is that we need to deal with conflicts. We will see that 
every conflict allows us to construct a new clause c that must be true whenever 
the existing clauses are satisfiable，although c itself does not contain any existing 
clause. Therefore we can “learn” c by adding it to the existing clauses, and we 
can try again. This learning process can’t go on forever，because only finitely 
many clauses are possible. Sooner or later we will therefore either find a solution 
or learn the empty clause. That will be nice，especially if it happens sooner. 

A conflict clause c on decision level d has the form f V V … V 知 ， where 
l and all the a’s belong to the trail; furthermore l and at least one a; belong to 
level d. We can assume that l is rightmost in the trail, of all the literals in c. 
Hence l cannot be the dth decision; and it has a reason, say Z V 的 V … V 私 , • 
Resolving c with this reason gives the clause d = ai V • • • V a/c V V • • • V a^,, 
which includes at least one literal belonging to level d. If more than one such 
literal is present，then d is itself a conflict clause; we can set c c 1 and repeat 
the process. Eventually we are bound to obtain a new clause d of the form 
P V V … V ， where l 1 is on level d and where b\ through b r are on lower levels. 

Such a d is learnable，as desired，because it can’t contain any existing 
clauses. (Every sub clause of c’，including d itself，would otherwise have given us 
something to force at a lower level.) We can now discard levels > d 1 of the trail ， 
where d 1 is the maximum level of 61 through 6 r ; and — this is the punch line — 
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we can append I 1 to the end of level d! ^ with d as its reason. The forcing process 
now resumes at level d! ^ as if the learned clause had been present all along. 

For example，after the conflict in ( 113 )，the initial conflict clause is c = 258, 
our shorthand notation for ^2 VX 5 V^g ； and its rightmost complemented literal in 
the trail is 2, because 5 and 8 came earlier. So we resolve c with 246, the reason 
for 2, and get d — 4568. This new clause contains complements of three literals 
from level 3, namely 4, 5, and 8 ; so it’s still a conflict clause. We resolve it with 
the reason for 8 and get d — 456. Again d is a conflict clause. But the result 
of resolving this conflict with the reason for 5 is c 7 = 46, a clause that is falsified 
by the literals currently on the trail but has only 4 at level 3. Good — we have 
learned c 46 ’： In every solution to waerden(3^ 3; 9)，either X 4 or xq must be true. 

Thus the sequel to ( 113 ) is 

t L t level reason 

0 6 1 A (a decision) ( 114 ) 

14 1 46 (the newly learned clause) 

and the next step will be to begin a new level 2 , because nothing more is forced. 

Notice that the former level 2 has gone away. We’ve learned that there was 
no need to branch on the decision variable xq^ because 6 already forces 4. This 
improvement to the usual backtrack regimen is sometimes called “backjumping，” 
because we’ve jumped back to a level that can be regarded as the root cause of 
the conflict that was just discovered. 

Exercise 253 explores a possible continuation of ( 114 ); dear reader，please 
jump to it now. Incidentally, the clause c 46’ that we learned in this example 
involves the complements of former decisions 4 and 6 ; but exercise 255 shows 
that newly learned clauses might not contain any decision variables whatsoever. 

The process of constructing the learned clause from a conflict is not as 
difficult as it may seem，because there’s an efficient way to perform all of the 
necessary resolution steps. Suppose, as above, that the initial conflict clause is 
[Vai V • • • Va/c. Then we “stamp” each of the literals with a unique number s; 
and we also insert ai into an auxiliary array，which will eventually hold the 
literals 61 , …， whenever ai is a literal that received its value on a level d! 
with 0 < < d. We stamp l too; and we count how many literals of level d 

have thereby been stamped. Then we repeatedly go back through the trail until 
coming to a literal L t whose stamp equals s. If the counter is bigger than 1 
at this point, and if the reason of is V V • • • V o! k ,^ we look at each a ;， 
stamping it and possibly putting it into the b array if it had not already been 
stamped with s. Eventually the count of unresolved literals will decrease to 1; 
the learned clause is then V 61 V • • • V 6 r . 

These new clauses might turn out to be quite large, even when we’re solving a 
problem whose clauses were rather small to start with. For example，Table 3 gives 
a glimpse of typical behavior in a medium - size problem. It shows the beginning 
of the trail generated when a CDCL solver was applied to the 2779 clauses of 
waerden(3^ 10; 97), after about 10,000 clauses had been learned. (Recall that 
this problem tries to find a binary vector X 1 X 2 ... ^97 that has no three equally 


backtrack 

backjumping 

look-back, see backjumping 

stamp 

waerden 


September 23, 2015 



7 . 2 . 2.2 


SATISFIABILITY: CLAUSE-LEARNING ALGORITHMS 


65 


Table 3 


THE FIRST LEVELS OF A MODERATE-SIZE TRAIL 


t 

L t 

level 

reason 

t 

L t 

level 

reason 

t 

L t 

level 

reason 

0 

53 

1 

A 

15 

70 

11 

70 36 53 

30 

08 

15 

08 46 27 

1 

55 

2 

A 

16 

35 

12 

A 

31 

65 

15 

65 46 27 

2 

44 

3 

A 

17 

39 

13 

A 

32 

60 

15 

60 46 53 

3 

54 

4 

A 

18 

J7 

14 

A 

33 

50 

15 

氺氺 

4 

43 

5 

A 

19 

38 

14 

38 3736 

34 

64 

15 

64 50 36 

5 

30 

6 

A 

20 

47 

14 

47 3727 

35 

22 

15 

22 50 36 

6 

34 

7 

A 

21 

17 

14 

17 3727 

36 

24 

15 

24 50 37 

7 

45 

8 

A 

22 

32 

14 

323727 

37 

42 

15 

42 50 46 

8 

40 

9 

A 

23 

69 

14 

69 3753 

38 

48 

15 

48 50 46 

9 

27 

10 

A 

24 

21 

14 

213753 

39 

73 

15 

73 50 27 

10 

79 

10 

79 53 27 

25 

46 

15 

A 

40 

04 

15 

04 50 27 

11 

01 

10 

0127 53 

26 

28 

15 

28 46 37 

41 

63 

15 

63 50 37 

12 

M 

11 

A 

27 

41 

15 

4146 36 

42 

33 

16 

A 

13 

18 

11 

18 36 27 

28 

26 

15 

26 46 36 

43 

51 

17 

A 

14 

19 

11 

19 36 53 

29 

56 

15 

56 46 36 

44 

57 

18 

A 


(Here ** stands for the previously learned clause 50 26 47 35 41 32 38 44 27 45 55 65 60 7030.) 


redundant 

unit propagation 

lazy data structures 


spaced Os and no ten equally spaced Is.) Level 18 in the table has just been 
launched with the decision L 44 = 57; and that decision will trigger the setting 
of many more literals 15, 49, 61，68, 77, 78, 87, 96，…， eventually leading to a 
conflict when trying to set L 67 . The conflict clause turns out to have length 22: 

53 27 36 70 35 37 69 2T 46 28 M 65 60 50 64 24 42 73 M 33 5157. ( 115 ) 

(Its literals are shown here in order of the appearance of their complements in 
the trail.) When we see such a monster clause，we might well question whether 
we really want to “learn” such an obscure fact! 

A closer look, however，reveals that many of the literals in ( 115 ) are redun¬ 
dant. For example, 70 can safely be deleted, because its reason is ‘70 36 53’； both 
36 and 53 already appear in ( 115 )，hence ( 115)0 (70 36 53) gets rid of 70. Indeed， 
more than half of the literals in this example are redundant, and ( 115 ) can be 
simplified to the much shorter and more memorable clause 

53 27 36 35 37 46 50 33 5157. ( 116 ) 

Exercise 257 explains how to discover such simplifications, which turn out to 
be quite important in practice. For example, the clauses learned while proving 
waerden(3, 10; 97) unsatisfiable had an average length of 19.9 before simplifica¬ 
tion, but only 11.2 after; simplification made the algorithm run about 33% faster. 

Most of the computation time of a CDCL solver is devoted to unit propa¬ 
gation. Thus we need to know when the value of a literal has been forced by 
previous assignments, and we hope to know it quickly. The idea of “lazy data 
structures，” used above in Algorithm D，works nicely for this purpose，in the pres¬ 
ence of long clauses, provided that we extend it so that every clause now has two 
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watched literals instead of one. If we know that the first two literals of a clause are 
not false, then we needn’t look at this clause until one of them becomes false, even 
though other literals in the clause might be repeatedly veering between transient 
states of true，false，and undefined. And when a watchee does become false，well 
try to swap it with a nonfalse partner that can be watched instead. Propagations 
or conflicts will arise only when all of the remaining literals are false. 

Algorithm C below therefore represents clauses with the following data 
structures: A monolithic array called MEM is assumed to be large enough to 
hold all of the literals in all of the clauses, interspersed with control information. 
Each clause c = Zo V “ V • • • V lk-i with A: > 1 is represented by its starting 
position in MEM ， with MEM [c + j] = lj for 0 < j < k. Its two watched literals are 
Iq and Zi ，and its size k is stored in MEM [c — 1]. Unit clauses, for which k = 1， 
are treated differently; they appear in level 0 of the trail，not in MEM. 

A learned clause c can be distinguished from an initial clause because it has 
a relatively high number, with MINL < c < MAXL. Initially MAXL is set equal to 
MINL ， the smallest cell in MEM that is available for learned clauses; then MAXL 
grows as new clauses are added to the repertoire. The set of learned clauses is 
periodically culled，so that the less desirable ones don’t clutter up memory and 
slow things down. Additional information about a learned clause c is kept in 
MEM [c — 4] and MEM[c — 5] ， to help with this recycling process (see below). 

Individual literals Xk and for 1 < A: < n, are represented internally by 
the numbers 2k and 2^ + 1 as in ( 57 ) above. And each of these 2n literals l has 
a list pointer W/，which begins a linked list of the clauses in which l is watched. 
We have W/ = 0 if there is no such clause; but if W\ = c > 0, the next link in 

this a watch list” is in MEM[c — 2] if l = Iq, in MEM[c — 3] if Z = Zi. [See Armin 

Biere， Journal on Satisfiability^ Boolean Modeling and Comp. 4 (2008)，75-97.] 

For example，the first few cells of MEM might contain the following data when 
we are representing the clauses (g) of waerden(3^ 3; 9): 

i = 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 ... 

MEM [i] 二 9 45 3 2 4 6 15 51 3 4 6 8 21 45 3 6 8 10 … 

(Clause 3 is ‘123’, clause 9 is ‘234’, clause 15 is 4 345’, clause 45 is ‘135’, 
clause 51 is c 246’， …； the watch lists for literals : ri， 奶，$ 3 , $4 begin respectively 

at W 2 =3,W 4 = 3, =9,W 8 = 15.) 

The other major data structures of Algorithm C are focused on variables，not 
clauses. Each variable Xk for 1 < k < n has six current attributes S(k) ， VAL(k)^ 
OVAL (AO ， TL0C(AO, HLOC(fc)，and ACT(fc)，which interact as follows: S(k) is the 
“stamp” that’s used during clause formation. If neither Xk nor Xk appears in 
the current trail, then VAL(^) = — 1， and we say that Xk and its two literals are 
“free •” But if = Hs a literal of the trail，belonging to level d，we have 

VAL (⑷ ）= 2d + (Z & 1) and TL0C(|/|) = t, where ⑷ =Z 》1， ( 117 ) 

and we say that l is “true” and l is “false •” Thus a given literal l is false if and 
only if VAL(|/|) is nonnegative and VAL(|/|) + l is odd. In most cases a watched 
literal is not false; but there are exceptions to this rule (see exercise 261). 
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The attributes ACT (A:) and HL0C(A0 tell the algorithm how to select the 
next decision variable. Each variable Xk has an activity score ACT (AO ， which 
heuristically estimates its desirability for branching. All of the free variables, 
and possibly others, are kept in an array called HEAP, which is arranged so that 

ACT (HEAP [j]) < ACT ( HEAP [(j - 1) > 1] ) for 0 < jf < /i ( 118 ) 

when it contains h elements (see Section 5.2.3). Thus HEAP [0] will always be a 
free variable of maximum activity, if it is free; so it’s the variable that will be 
chosen to govern the decision when the trail starts to acquire a new level. 

Activity scores help the algorithm to focus on recent conflicts. Suppose, 
for example, that 100 conflicts have been resolved, hence 100 clauses have been 
learned. Suppose further that Xj or Xj was stamped while resolving the conflicts 
numbered 3, 47, 95, 99, and 100; but Xk or Xk was stamped during conflicts 41， 
87, 94, 95, 96, and 97. We could express their recent activity by computing 

ACT(j) = p Q + p 1 + p 5 + p 53 + p 97 , ACT(fc) = p 3 + p 4 + p 5 + p 6 + p 13 + p 59 , 

where p is a damping factor (say p = .95), because 100 — 100 = 0, 100 — 99 = 1， 
100 — 95 = 5,…， 100 — 41 = 59. In this particular case j would be considered 
to be less active than k unless p is less than about .8744. 

In order to update the activity scores according to this measure，we would 
have to do quite a bit of recomputation whenever a new conflict occurs: The new 
scores would require us to multiply all n of the old scores by p，then to increase 
the activity of every newly stamped variable by 1. But there’s a much better 
way，namely to compute p -100 times the scores shown above: 

ACT(jf) = p— 3 +p— 47 + p— 95 +/T 99 +p— 100 , ACT(fc) = p 一 41 + ... + p— 96 +p— 97 . 

These newly scaled scores, suggested by Niklas Een, give us the same information 
about the relative activity of each variable; and they’re updated easily，because 
we need to do only one addition per stamped variable when resolving conflicts. 

The only problem is that the new scores can become really huge, because 
p~ M can cause floating point overflow after the number M of conflicts becomes 
large. The remedy is to divide them all by 10 100 , say, whenever any variable gets 
a score that exceeds 10 lo °. The HEAP needn’t change, since ( 118 ) still holds. 

During the algorithm the variable DEL holds the current scaling factor p- M , 
divided by 10 lo ° each time all of the activities have been rescaled. 

Finally，the parity of OVAL (AO is used to control the polarity of each new 
decision in step C 6 . Algorithm C starts by simply making each OVAL (AO odd， 
although other initialization schemes are possible. Afterwards it sets 0VAL(A:) 4- 
VAL(A0 whenever Xk leaves the trail and becomes free, as recommended by 

D. Frost and R. Dechter [AAAI Coni. 12 (1994), 301-306] and independently 
by K. Pipatsrisawat and A. Darwiche [LNCS 4501 (2007), 294-299], because 
experience has shown that the recently forced polarities tend to remain good. 
This technique is called “sticking” or “progress saving” or “phase saving.” 

Algorithm C is based on the framework of a pioneering CDCL solver called 
Chaff, and on an early descendant of Chaff called MiniSAT that was developed 
by N. Een and N. Sorensson [LNCS 2919 (2004)，502-518]. 
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Algorithm C (Satisfiability by CD CL). Given a set of clauses on n Boolean 
variables, this algorithm finds a solution L 0 Li ... L n _i if and only if the clauses 
are satisfiable，meanwhile discovering M new ones that are consequences of the 
originals. After discovering M p new clauses, it will purge some of them from its 
memory and reset M p ; after discovering Mf of them, it will flush part of its trail ， 
reset Mf ， and start over. (Details of purging and flushing will be discussed later.) 

Cl. [Initialize.] Set VAL(fc) ^ OVAL(fc) 4-TLOC(fc) ^-1, ACT(fc) ^-S(fc) —0, 
R 2 k 丑 2/c+i t A ， HL0C(AO pk — 1 ， and HEAP [p^ — 1] 4 - for 1 < A: < n, 
where pi .. .p n is a random permutation of {1 ， … ， n}. Then input the 
clauses into MEM and the watch lists，as described above. Put the distinct unit 
clauses into LqLi ... Lf-i] but terminate unsuccessfully if there are contra¬ 
dictory clauses (l) and (I). Set MINL and MAXL to the first available position 
in MEM. (See exercise 260.) Set io^-d^s^-M^-G^-0, hm, DEL 11. 

C2. [Level complete?] (The trail Lq … Lf-i now contains all of the literals that 
are forced by L 0 … Lq-i) Go to C5 if G = F. 

C3. [Advance G] Set l 4 - Lq and G ^ G + 1. Then do step C4 for all c in the 
watch list of 『， unless that step detects a conflict and jumps to C7. If there 
is no conflict, return to C2. (See exercise 261.) 

C4. [Does c force a unit?] Let Wi … h-i be the literals of clause c，where h =1. 
(Swap /o h if necessary.) If Iq is true, do nothing. Otherwise look for a 
literal lj with 1 < j < k that is not false. If such a literal is found, move c 
to the watch list of lj. But if k ， … ， h-i are all false，jump to C7 if lo is 
also false. On the other hand if lo is free，make it true by setting Lf Iq, 
TLOC(J/oJ) i — F ， VAL(|/qJ) i — 2<i + {7o & 1), Ri 0 、 ~ c, &nci F i — F 

C5. [New level?] If F =n，terminate successfully. Otherwise if M > M p ， pre¬ 
pare to purge excess clauses (see below). Otherwise if M > Mf，flush literals 
as explained below and return to C2. Otherwise set d d+1 and id <- F. 

C6. [Make a decision.] Set k 4 - HEAP [0] and delete k from the heap (see exercises 
262 and 266). If VAL(fc) > 0, repeat this step. Otherwise set / + 

(OVAL(fc) & 1), VAL(fc) ^ 2d (OVAL(fc) & 1), L F ^ /, TL0C(|；|) ^ F, 
Ri A, and F F + 1. (At this point F = G + 1.) Go to C3. 

C7. [Resolve a conflict.] Terminate unsuccessfully if d = 0. Otherwise use the 
conflict clause c to construct a new clause Pv6i V - • • V6 r as described above. 
Set ACT(|/|) ^ ACT(|/|) + DEL for all literals l stamped during this process; 
also set d! to the maximum level occupied by {61 ， …， b r } in the trail. (See 
exercise 263. Increasing ACT(|/|) may also change HEAP.) 

C8. [Backjump.] While F > do the following: Set F F — 1, l Lp, 
k ^ \l\, OVAL(fc) VAL(fc), VAL(fc) ^ —— 1, R t A; and if HL0C(|；|) < 0 
insert k into HEAP (see exercise 262). Then set G <- F and d ^ d!. 

C9. [Learn.] If d > 0, set c MAXL, store the new clause in MEM at position c ， 
and advance MAXL to the next available position in MEM. (Exercise 263 gives 
full details.) Set Af" i — M - Lp i — l 1 , TLOC( | 厂 |) ^~ F ， Ri' i — c, F i — F+1, 

DEL 卜 DEL/p, and return to C3. | 
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The high-level operations on data structures in this algorithm are spelled out 
in terms of elementary low-level steps in exercises 260-263. Exercises 266-271 
discuss simple enhancements that were made in the experiments reported below. 

Reality check: Although detailed statistics about the performance of Algo¬ 
rithm C on a wide variety of problems will be presented later, a few examples of 
typical behavior will help now to clarify how the method actually works in prac¬ 
tice. Random choices make the running time of this algorithm more variable than 
it was in Algorithms A, B ， D，or L; sometimes we’re lucky, sometimes we’re not. 

In the case of waerden(3^ 10; 97), the modest 97-variable-and - 2779 - clause 
problem that was considered in Table 3, nine test runs of Algorithm C established 
unsatisfiability after making between 250 and 300 million memory accesses; the 
median was 272 Mfi. (This is more than twice as fast as our best previous time ， 
which was obtained with Algorithm L.) The average number of decisions made — 
namely the number of times Lp I was done in step C6 — was about 63 thou¬ 
sand; this compares to 1701 “nodes” in Algorithm L，step L3, and 100 million 
nodes in Algorithms A, B, D. About 53 thousand clauses were learned, having 
an average size of 11.5 literals (after averaging about 19.9 before simplification). 

Fig. 49. It is not 
possible to color the 
edges of the flower 
snark graph J q with 
three colors, when q 
is odd. Algorithm C 
is able to prove this 
with amazing speed: 

Computation times 
(in megamems) are 
shown for nine trials 
at each value of q. 

Algorithm C often speeds things up much more dramatically, in fact. For 
example ， Fig. 49 shows how it whips through a sequence of three-coloring prob¬ 
lems that are based on “flower snarks.” Exercise 176 defines fsnark(q)^ an 
interesting set of 42q + 3 unsatisfiable clauses on 18q variables. The running 
time of Algorithms A ， B ， D，and L on fsnark(q) is proportional to 2 g ，so it’s 
way off the chart —— well over a gigamem already when q = 19. But Algorithm C 
polishes off the case g = 99 in that same amount of time (thus winning by 24 
orders of magnitude)! On the other hand, no satisfactory theoretical explanation 
for the apparently linear behavior in Fig. 49 is presently known. 

Certificates of unsatisfiability. When a SAT solver reports that a given 
instance is satisfiable, it also produces a set of distinct literals from which we can 
easily check that every clause is satisfied. But if its report is negative — UNSAT — 
how confident can we be that such a claim is true? Maybe the implementation 
contains a subtle error; after all，large and complicated programs are notoriously 
buggy，and computer hardware isn’t perfect either. A negative answer can there¬ 
fore leave both programmers and users unsatisfied, as well as the problem. 
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WeVe seen that unsatisfiability can be proved rigorously by constructing 
a resolution refutation, namely a chain of resolution steps that ends with the 
empty clause e，as in Fig. 48. But such refutations amount to the construction 
of a huge directed acyclic graph. 

A much more compact characterization of unsatisfiability is possible. Let’s 
say that the sequence of clauses (Ci ， C 2 , • • • ， Ct) is a certificate of unsatisfiability 
for a family of clauses F if C t = 6 , and if we have 

F A (7i A • • • A Ci-i ACi hi e for 1 < i < t. ( 119 ) 

Here the subscript 1 in C G hi e 5 means that the clauses G lead to a contradiction 
by unit propagation] and if C{ is the clause (a\ V ••• V a/c)，then C{ is an 
abbreviation for the conjunction of unit clauses ( 知） 八 … 八 (a/c). 

For example ， let F = R be Rivest’s clauses ( 6 )，which were proved unsatis- 
fiable in Fig. 48. Then (12, 1 ， 2, e) is a certificate of unsatisfiability，because 


iZAl A2hi 3 hi 4hi 6 (using 123, 234, and 341); 

E A 12 A I K 2hi 4 hi 3 hi e (using 12, 412, 234, and 341); 
.RA 12 A 1 A2 hi 4 K 3 h € (using 4l2, 234, and 34l); 


i?A 12 A 1 A2 hi 3 hi 4 hi e (using 123, 234, and 34l). 

A certificate of unsatisfiability gives a convincing proof，since (ng) implies 
that each C{ must be true whenever F) Ci” • • ， C{-\ are true. And it’s easy to 
check whether or not G hi 6 , for any given set of clauses because everything is 
forced and no choices are involved. Unit propagation is analogous to water flow¬ 
ing downhill; we can be pretty sure that it has been implemented correctly，even 
if we don’t trust the CDCL solver that generated the certificate being checked. 

E. Goldberg and Y. Novikov [Proceedings of DATE: Design，Automation 
and Test in Europe 6,1 (2003), 886-891] have pointed out that CDCL solvers 

actually produce such certificates as a natural byproduct of their operation: 


Theorem G. If Algorithm C terminates unsuccessfully，the sequence 
• • • ， C t ) of clauses that it has learned is a certificate of unsatisfiability. 


Proof. It suffices to show that, whenever Algorithm C has learned the clause 
C’ = P V6i V - • • V6 r , unit propagation will deduce e if we append the unit clauses 
(l 1 ) 八 （ 6 i) 八…八 (b r ) to the clauses that the algorithm already knows. The key 
point is that C f has essentially been obtained by repeated resolution steps ， 

C 1 = (…((心也），)…..），， (120) 

where C is the original conflict clause and Ri” Ri” •… ， R“ are the reasons 
for each literal that was removed while C f was constructed in step C7. More 
precisely，we have C = Aq and Ri { = where all literals of AqUAiU^ -UA $ 

are false (their complements appear in the trail); and 


^ G Aq U • • • U 為 _i ， for 1 < i < s ; 

Aq U A\ U • • • U A s = {I’ ， li, • • • ， l s , b\” • • ， b r ) 


( 121 ) 


Thus the known clauses，plus 61 ， … ， 6 r ，and V^ will force l s using clause Ri 
And l s -i will then be forced，using Ri s _ 1 . And so on. | 
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Since the unit literals in this proof are propagated in reverse order l s ^ l s _u 
， h from the resolution steps in ( 120 )，this certificate-checking procedure has 
become known as “reverse unit propagation” [see A. Van Gelder ， Proc. Int. Symp. 
on Artificial Intelligence and Math. 10 (200^), 9 pages, online as ISAIM2008]. 

Notice that the proof of Theorem G doesn’t claim that reverse unit prop¬ 
agation will reconstruct the precise reasoning by which Algorithm C learned a 
clause. Many different downhill paths to 6, built from hi steps, usually exist in 
a typical situation. We merely have shown that every clause learnable from a 
single conflict does imply the existence of at least one such downhill path. 

Many of the clauses learned during a typical run of Algorithm C will be 
“shots in the dark，” which turn out to have been aimed in unfruitful directions. 
Thus the certificates in Theorem G will usually be longer than actually nec¬ 
essary to demonstrate unsatisfiability. For example，Algorithm C learns about 
53,000 clauses when refuting waerden(3, 10; 97)，and about 135,000 when refuting 
fsnark(99)] but fewer than 50,000 of the former，and fewer than 47,000 of the 
latter，were actually used in subsequent steps. Exercise 284 explains how to 
shorten a certificate of unsatisfiability while checking its validity. 

An unexpected difficulty arises ， however: We might spend more time veri¬ 
fying a certificate than we needed to generate it! For example，a certificate for 
waerden(3, 10; 97) was found in 272 megamems，but the time needed to check it 
with straightforward unit-propagations was actually 2.2 ^amems. Indeed, this 
discrepancy becomes significantly worse in larger problems，because a simple 
program for checking must keep all of the clauses active in its memory. If there 
are a million active clauses, there are two million literals being watched; hence 
every change to a literal will require many updates to the data structures. 

The solution to this problem is to provide extra hints to the certificate 
checker. As we are about to see，Algorithm C does not keep all of the learned 
clauses in its memory; it systematically purges its collection, so that the total 
number stays reasonable. At such times it can also inform the certificate checker 
that the purged clauses will no longer be relevant to the proof. 

Further improvements also allow annotated certificates to accommodate 
stronger proof rules, such as Tseytin’s extended resolution and techniques based 
on generalized autarkies; see N. Wetzler ， M. J. H. Heule，and W. A. Hunt ， Jr .， 
LNCS 8561 (2014), 422-429. 

Whenever a family of clauses has a certificate of unsatisfiability, a variant of 
Algorithm C will actually find one that isn’t too much longer. (See exercise 386.) 

^Purging unhelpful clauses. After thousands of conflicts have occurred, Algo¬ 
rithm C has learned thousands of new clauses. New clauses guide the search 
by steering us away from unproductive paths; but they also slow down the 
propagation process，because we have to watch them. 

We’ve seen that certificates can usually be shortened; therefore we know 
that many of the learned clauses will probably never be needed again. For this 
reason Algorithm C periodically attempts to weed out the ones that appear to 
be more harmful than helpful, by ranking the clauses that have accumulated. 
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I consider that a man's brain originally is like a little empty attic, and 

you have to stock it with such furniture as you choose.the skilled workman 

is very careful indeed as to what he takes into his brain-attic. 

... It is a mistake to think that that little room has elastic walls 
and can distend to any extent. … It is of the highest importance, therefore, 

not to have useless facts elbowing out the useful ones. 

— SHERLOCK HOLMES, in 4 Study in Scarlet (1887) 

Algorithm C initiates a special clause-refinement process as soon as it has 
learned M > M p clauses and arrived at a reasonably stable state (step C5). 
Let’s continue our running example ， 臟 erden(3,10; 97)，in order to make the 
issues concrete. If M p is so huge that no clauses are ever thrown away，a typical 
run will learn roughly 48 thousand clauses, and do roughly 800 megamems of 
computation, before proving unsatisfiability. But if M p = 10000, it will learn 
roughly 50 thousand clauses, and the computation time will go down to about 
500 megamems. In the latter case the total number of learned clauses in memory 
will rarely exceed 10 thousand. 

Indeed ， let’s set M p = 10000 and take a close look at exactly what happened 
during the author’s first experiments. Algorithm C paused to reconnoiter the 
situation after having learned 10002 clauses. At that point only 6252 of those 
10002 clauses were actually present in memory ， however，because of the clause¬ 
discarding mechanism discussed in exercise 271. Some clauses had length 2, while 
the maximum size was 24 and the median was 11; here’s a complete histogram: 

2 9 49 126 216 371 542 719 882 1094 661 540 414 269 176 111 35 20 10 3 1 1 1. 

Short clauses tend to be more useful, because they reduce more quickly to units. 

A learned clause cannot be purged if it is the reason for one of the literals 
on the trail. In our example, 12 of the 6252 fell into this category; for instance ， 
30 appeared on level 10 of the trail because ‘30 33 39 41 42 45 46 48 54 57’ had 

been learned，and we may need to know that clause in a future resolution step. 

The purging process will try to remove at least half of the existing learned 
clauses, so that at most 3126 remain. We aren’t allowed to touch the 12 reason- 
bound ones; hence we want to forget 3114 of the other 6240. Which of them 
should we expel? 

Among many heuristics that have been tried，the most successful in practice 
are based on what Gilles Audemard and Laurent Simon have called “literal block 
distance” [see Proc. Int. Joint Conference on Artificial Intelligence 21 (2009 )， 
399-404]. They observed that each level of the trail can be considered to be a 
block of more - or-less related variables; hence a long clause might turn out to be 
more useful than a short clause, if the literals of the long one all lie on just one 
or two levels while the literals of the short one belong to three or more. 

Suppose all the literals of a clause C = “ V • • • V ~ appear in the trail，either 
positively as lj or negatively as lj. We can group them by level so that exactly 
p + q levels are represented，where p of the levels contain at least one positive lj 
and the other q contain nothing but 『 /s. Then (p^q) is the signature of C with 
respect to the trail，and p + g is the literal block distance. For example，the very 
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/ 0 4 17 22 30 54 67 99 17 \ 

17 81 191 395 360 404 438 66 6 ' 

63 232 463 536 521 386 117 6 0 

52 243 291 298 308 112 22 0 0 

18 59 86 77 53 7 0 0 0 

083 10 00000 

Vooioooooo/ 


The matrix on the left shows how many of the 6240 eligible clauses had a given 
signature (p ， g), for 1 < p < 7 and 0 < g < 8 ; the matrix on the right shows how 
many would have been used to resolve future conflicts, if none of them had been 
removed. There were, for example, 536 learned clauses with p = g = 3, of which 
only 86 actually turned out to be useful. This data is illustrated graphically in 
Fig. 50, which shows gray rectangles whose areas correspond to the left matrix ， 
overlaid by black rectangles whose areas correspond to the right matrix. We can’t 
predict the future, but small (p ， g) tends to increase the ratio of black to gray. 
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Fig- 50. Learned clauses that have 
p positive and q all-negative levels. 
The gray ones will never be used 
again. Unfortunately 5 there’s no easy 
way to distinguish gray from black 
without being clairvoyant- 


An alert reader will be wondering ， however，how such signatures were found ， 
because we can’t compute them for all clauses until all variables appear in the 
trail — and that doesn’t happen until all clauses are satisfied! The answer [see 
A. Goultiaeva and F. Bacchus, LNCS 7317 (2012), 30-43] is that it’s quite 

possible to carry out a “full run” in which every variable is assigned a value, 
by making only a slight change to the normal behavior of Algorithm C: Instead 


first clause learned from waerden(3, 10; 97) in the author’s test run was VAL 

Goultiaeva 

TT16 2T 26 M 46 5T 61 66 91; ( 122 ) full run 

later, when it was time to rank clauses for purging, the values and trail levels of 
those literals were specified by VAL (11 )， VAL (16 )， …， VAL (91 )， which were 

20 21 21 21 20 15 16 8 14 20. 

Thus 61 was true on level 8 》 1 = 4; 46 and 66 were true on level 15 》 1 = 

14 》 1 = 7; 51 was false on level 8; the others were a mixture of true and false 
on level 10; hence ( 122 ) had p = 3 and q = 1 with respect to the current trail. 

If C has signature (p, q) and C 1 has signature (p’ ， g’)，where p <p' and q < q' 
and (p ， g) / (p 1 ， g’)，we can expect that C is more likely than C 1 to be useful in 
future propagations. The same conclusion is plausible also when p + g = 
and p < p 1 , because C r won’t force anything until literals from at least p + 1 
different levels change sign. These intuitive expectations are borne out by the 
following detailed data obtained from waerden(3, 10; 97): 
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of resolving conflicts immediately and backjumping，we can carry on after each 
conflict until all propagations cease, and we can continue to build the trail in 
the same way until every variable is present on some level. Conflicts may have 
occurred on several different levels; but we can safely resolve them later, learning 
new clauses at that time. Meanwhile, a full trail allows us to compute signatures 
based on VAL fields. And those VAL fields go into the OVAL fields after backjump- 
ing，so the variables in each block will tend to maintain their relationships. 

The author’s implementation of Algorithm C assigns an eight-bit value 

RANGE(c) •<— min ([16(p -1- aq )\, 255) ( 123 ) 

to each clause c; here a is a parameter, 0 < a < 1. We also set RANGE (c) ^ 0 
if c is the reason for some literal in the trail; RANGE(c) 256 if c is satisfied 

at level 0. If there are rrij clauses of range jf, and if we want to keep at most T 
clauses in memory，we find the largest j < 256 such that 

rrij > 0 and Sj = mo + mi + ••• + Trij-i < T. ( 124 ) 

Then we retain all clauses for which RANGE (c) < j, together with T — Sj u tie- 
breakers” that have RANGE (c) = j (unless j = 256). When a has the relatively 
high value = .9375, this rule essentially preserves as many clauses of small 
literal block distance as it can; and for constant p + g it favors those with small p. 

For example, with a = and the data from Fig. 50, we save clauses that 
have p = (1 ， 2,3,4,5) when q < (5, 4,3, 2,0) ， respectively. This gives us S 95 = 
12 + 3069 clauses, just 45 shy of our target T = 3126. So we also choose 45 
tie-breakers from among the 59 clauses that have RANGE (c) = 95, (p^q) = (5,1). 

Tie-breaking can be done by using a secondary heuristic ACT(c)，“clause 
activity^ which is analogous to the activity score of a variable but it is more 
easily maintained. If clause c has been used to resolve the conflicts numbered 3, 
47, 95, 99, and 100, say, then 

ACT(c) = 3 + p — 47 + p 一 95 +p 一 99 + p— 100 . ( 125 ) 

This damping factor g (normally .999) is independent of the factor p that is used 
for variable activities. In the case of Fig. 50, if the 59 clauses with (p^ q) = (5,1) 
are arranged in order of increasing ACT scores, the gray - and-black pattern is 

■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■a. 

So if we retain the 45 with highest activity, we pick up 8 of the 10 that turn out 
to be useful. (Clause activities are imperfect predictors, but they are usually 
somewhat better than this example implies.) 

Exercises 287 and 288 present full details of clause purging in accordance 
with these ideas. One question remains: After we’ve completed a purge, when 
should we schedule the next one? Successful results are obtained by having two 
parameters ， A p and 6 P . Initially M p = A p ; then after each purge，we set A p ^― 
A p +(5 P and M p i- M p + A p . For example，if A p = 10000 and 6 P = 100, purging 

will occur after approximately 10000, 20100, 30300, 40600, …， kA p + O p , 
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… clauses have been learned; and the number of clauses at the beginning of the 
kth. round will be approximately 2 ⑻ 00 + 200k — 2A P + 2k6 p . (See exercise 289.) 

We’ve based this discussion on waerden(3^ 10; 97), which is quite a simple 
problem. Algorithm C’s gain from clause-purging on larger problems is naturally 
much more substantial. For example ， waerden(3, 13; 160) is only a bit larger than 
waerden(3, 10; 97). With A p 二 10000 and S p = 100, it finishes in 132 gigamems, 
after learning 9.5 million clauses and occupying only 503 thousand MEM cells. 
Without purging, it proves unsatisfiability after learning only 7.1 million clauses, 
yet at well over ten times the cost: 4307 gigamems，and 102 million cells of MEM. 

*Flushing literals and restarting. Algorithm C interrupts itself in step C5 not 
only to purge clauses but also to “flush literals” that may not have been the best 
choices for decisions in the trail. The task of solving a tough satisfiability problem 
is a delicate balancing act: We don’t want to get bogged down in the wrong part 
of the search space; but we also don’t want to lose the fruits of hard work by 
“throwing out the baby with the bath water.” A nice compromise has been found 
by Peter van der Tak，Antonio Ramos, and Marijn Heule [J. Satisfiability ， Bool. 
Modeling and Comp. 7 (2011) ， 133—138]，who devised a useful way to rejuvenate 
the trail periodically by following trends in the activity scores ACT (AO. 

Let’s go back to Table 3， to illustrate their method. After learning the 
clause ( 116 )，Algorithm C will update the trail by setting L 44 ^ 57 on level 
17; that will force L 45 ^ 66 , because 39, 42，…， 63 have all become true; and 
further positive literals 6 , 58, 82, 86 , 95, 96 will also join the trail in some order. 
Step C5 might then intervene to suggest that we should contemplate flushing 
some or all of the F = 52 literals whose values are currently assigned. 

The decision literals 53 ， 55, 44，… ” 51 on levels 1 ， 2, 3，… ” 17 each were 
selected because they had the greatest current activity scores when their level be¬ 
gan. But activity scores are continually being updated, so the old ones might be 
considerably out of touch with present realities. For example ， we’ve just boosted 
ACT (53), ACT (27), ACT (36), ACT (70), , in the process of learning ( 116 ) — see 

( 115 ). Thus it’s quite possible that several of the first 17 decisions no longer 
seem wise, because those literals haven’t participated in any recent conflicts. 

Let Xk be a variable with maximum ACT (AO ， among all of the variables not 
in the current trail. It’s easy to find such a k (see exercise 290). Now consider ， 
as a thought experiment, what would happen if we were to jump back all the 
way to level 0 at this point and start over. Recall that our phase-saving strategy 

dictates that we would set 0VAL(j) -f- VAL(j) just before setting VAL(j) < -1 ， 

as the variables become unassigned. 

If we now restart at step C 6 with d ^ 1， all variables whose activity exceeds 
ACT (AO will receive their former values (although not necessarily in the same 
order), because the corresponding literals will enter the trail either as decisions 
or as forced propagations. History will more or less repeat itself, because the old 
assignments did not cause any conflicts, and because phases were saved. 

We might as well therefore avoid most of this back-and - forth unsetting and 
resetting，by reusing the trail and jumping back only partway，to the first level 
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where the current activity scores significantly change the picture: 

Set d! 0. While ACT(|L^ /+ 1 1) > ACT(fc), set d’ <- d’ + 1. ( 126 ) 

Then if d! < jump back to level d!. 

This is the technique called “literal flushing，” because it removes the literals on 
levels d’ + 1 through d and leaves the others assigned. It effectively redirects the 
search into new territory，without being as drastic as a full restart. 

In Table 3, for example ， ACT(49) might exceed the activity score of every 
other unassigned variable; and it might also exceed ACT(46)，the activity of 
the decision literal 46 on level 15. If the previous 14 decision-oriented activities 
ACT (53), ACT (55), , ACT (37) are all > ACT (49), we would flush all the literals 
[ 25， [‘ 26 ， … above level d! = 14, and commence a new level 15. 

Notice that some of the flushed literals other than 46 might actually have 
the largest activities of all. In such cases they will re-insert themselves, before 
49 ever enters the scene. Eventually, though, the literal 49 will inaugurate a new 
level before a new conflict arises. (See exercise 291.) 

Experience shows that flushing can indeed be extremely helpful. On the 
other hand, it can be harmful if it causes us to abandon a fruitful line of attack. 
When the solver is perking along and learning useful clauses by the dozen，we 
don’t want to upset the applecart by rocking the boat. Armin Biere has therefore 
introduced a useful statistic called agility, which tends to be correlated with the 
desirability of flushing at any given moment. His idea [LNCS 4996 (2008), 28- 
33] is beautifully simple: We maintain a 32-bit integer variable called AGILITY ， 
initially zero. Whenever a literal l is placed on the trail in steps C4, C 6 , or C9, 
we update the agility by setting 

AGILITY AGILITY—(AGILITY 》 13) + (((0VAL(|Z|)—VAL(|Z|))&1) 《 19). ( 127 ) 

In other words，the fraction AGILITY/2 32 is essentially multiplied by 1 — then 
increased by 8 if the new polarity of l differs from its previous polarity, where 
6 = 2 — 13 w .0001. High agility means that lots of the recent propagations are 
flipping the values of variables and trying new possibilities; low agility means 
that the algorithm is basically in a rut，spinning its wheels and getting nowhere. 

Table 4 

TO FLUSH OR NOT TO FLUSH? 


Let a = AGILITY/2 32 when setting Mf M + Af, and let ^ = 1/6, 6 — 17/16- 


If Af is 

then flush if 

If A f is 

then flush if 

If A f is 

then flush if 

1 

a < i/j ^ .17 

32 

a<e 5 ip^ .23 

1024 

a < O 10 if; « .31 

2 

a < ^ .18 

64 

a<0 6 ^ .24 

2048 

a < O 11 ^ ^ .32 

4 

a < 6 2 ^ ^ .19 

128 

a<0 7 ip^ .25 

4096 

a < 6 12 ^ « .34 

8 

a < O 3 ^ ^ .20 

256 

a<0 8 xp^ .27 

8192 

a < 0 ls ip « .37 

16 

a < ^ -21 

512 

a < « .29 

16384 

a < 6< 14 ^ - -39 
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Armed with the notion of agility，we can finally state what Algorithm C 
does when step C5 finds M > Mf ： First Mf is reset to M + Af ， where Af is 
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a power of two determined by the “reluctant doubling” sequence 〈 1 ， 1 ， 2 ， 1 ， 1 ， 
2 ， 4 ， 1，…〉； that sequence is discussed below and in exercise 293. Then the 
agility is compared to a threshold, depending on Af ， according to the schedule 
in Table 4. (The parameter ^ in that table can be raised or lowered，if you want 
to increase or decrease the amount of flushing.) If the agility is sufficiently small ， 
Xk is found and ( 126 ) is performed. Nothing changes if the agility is large or if 
d! = d\ otherwise ( 126 ) has flushed some literals，using the operations of step C 8 . 

Monte Carlo methods. Let’s turn now to a completely different way to 
approach satisfiability problems, based on finding solutions by totally heuristic 
and randomized methods, often called stochastic local search. We often use such 
methods in our daily lives, even though there’s no guarantee of success. The 
simplest satisfiability-oriented technique of this kind was introduced by Jun Gu 
[see SIGART Bulletin 3,1 (January 1992), 8-12] and by Christos Papadimitriou 
[FOCS 32 (1991), 163-169] as a byproduct of more general studies: 

“Start with any truth assignment. While there are unsatisfied 
clauses, pick any one, and flip a random literal in it •” 

Some programmers are known to debug their code in a haphazard manner, 
somewhat like this approach; and we know that such “blind” changes are foolish 
because they usually introduce new bugs. Yet this idea does have merit when it 
is applied to satisfiability，so we shall formulate it as an algorithm: 

Algorithm P (Satisfiability by random walk). Given m nonempty clauses 
(7i 八…八 C m on n Boolean variables x\ .. .x n ^ this algorithm either finds a 
solution or terminates unsuccessfully after making N trials. 

PI. [Initialize.] Assign random Boolean values to xi .. .x n . Set i 0, s I 0, 
and t ^ 0. (We know that s clauses are satisfied after having made t flips.) 

P2. [Success?] If s = m, terminate successfully with solution xi .. .x n . Other¬ 
wise set j ^r- (j mod m) + l. If clause Cj is satisfied by x\ .. .x n ^ set s 4 - s + 1 
and repeat this step. 

P3. [Done?] li t = N ， terminate unsuccessfully. 

P4. [Flip one bit.] Let clause Cj be (Zi V … V 4). Choose a random index 
i G {1,... ，众 }，and change variable |^| so that literal li becomes true. Set 
s e 1， f f + 1， and return to P2. | 

Suppose, for example，that we’re given the seven clauses R 1 of ( 7 ). Thus 
m = 7, n = 4; and there are two solutions ， 01*1. In this case every nonsolution 
violates a unique clause; for example, 1100 violates the clause 123, so step P4 is 
equally likely to change 1100 to 0100 , 1000 , or 1110 , only one of which is closer 
to a solution. An exact analysis (see exercise 294) shows that Algorithm P will 
find a solution after making 8.25 flips, on the average. That’s no improvement 
over a brute-force search through all 2 n = 16 possibilities; but a small example 
like this doesn’t tell us much about what happens when n is large. 

Papadimitriou observed that Algorithm P is reasonably effective when it’s 
applied to 2SAT problems, because each flip has roughly a 50-50 chance of making 
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progress in that case. Several years later，Uwe Schoning [Algorithmic^ 32 (2002)， 
615-623] discovered that the algorithm also does surprisingly well on instances of 
3SAT，even though the flips when A: > 2 in step P4 tend to go “the wrong way”: 


Theorem U. If the given clauses are satisHable，and if each clause has at 
most three literals，Algorithm P will succeed with probability H((3/4) n /n) after 
making at most n flips. 

Proof. By complementing variables, if necessary, we can assume that 0 … 0 is 
a solution; under this assumption, every clause has at least one negative literal. 
Let X t = xi + - • + x n be the number of Is after t flips have been made. Each 
flip changes by 士1， and we want to show that there’s a nontrivial chance that 
X t will become 0. After step Pl，the random variable Xq will be equal to q with 
probability (口 )/2 n . 

A clause that contains three negative literals is good news for Algorithm P， 
because it is violated only when all three variables are 1; a flip will always 
decrease X t in such a case. Similarly，a violated clause with two negatives 
and one positive will invoke a flip that makes progress 2/3 of the time. The 
worst case occurs only when a problematic clause has only one negative literal. 
Unfortunately，every clause might belong to this worst case, for all we know. 

Instead of studying X t , which depends on the pattern of clauses, it’s much 
easier to study another random variable Y t defined as follows: Initially Yq = X 0 ; 
but Y t+ i = Y t — 1 only when step P4 flips the negative literal that has the 
smallest subscript; otherwise Y t+ i =1^ + 1. For example，after taking care of a 
violated clause such as we have X t+ i = X t + (+l，一1，—1) but F^+i = 

+ (+1，一 1， +1) in the three possible cases. Furthermore, if the clause contains 
fewer than three literals, we penalize Y t+ i even more，by allowing it to be Y t — 1 
only with probability 1/3. (After a clause such as X 4 W for instance, we put 
Y t+ i = Y t — lm only 2/3 of the cases when xq is flipped; otherwise Y t+ i = Y t + 1.) 

We clearly have X t < Y t for all t. Therefore Pr(X^ = 0) > Pr(l^ = 0)， 
after t flips have been made; and we’ve defined things so that it’s quite easy to 
calculate Pr(Y t = 0)，because Y t doesn’t depend on the current clause j : 

Pr(y t+ 1 =Y t -l) = l/3 and Pr(F t+ i =Y t + l) = 2/3 when Y t > 0. 


Indeed，the theory of random walks developed in Section 7.2.1.6 tells us how to 
count the number of scenarios that begin with Yq = q and end with Y t = 0, after 
Yt has increased p times and decreased p + q times while remaining positive for 
0 < t < 2p q. It is the “ballot number” of Eq. 7.2.1. 6 -( 23 ), 

^p+o - 1 = 2^C P p 9 )' (l28) 

The probability that Yq = q and that Y t = 0 for the first time when t = + q 

is therefore exactly 
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Every value of p and q gives a lower bound for the probability that Algorithm P 
succeeds; and exercise 296 shows that we get the result claimed in Theorem U 
by choosing p = q ^ n/3. | 

Theorem U might seem pointless，because it predicts success only with 
exponentially small probability when N = n. But if at first we don’t succeed，we 
can try and try again, by repeating Algorithm P with different random choices. 
And if we repeat it Kn(4/3) n times，for large enough K ， we’re almost certain to 
find a solution unless the clauses can’t all be satisfied. 

In fact，even more is true, because the proof of Theorem U doesn’t exploit the 
full power of Eq. ( 129 ). Exercise 297 carries the analysis further, in a particularly 
instructive way, and proves a much sharper result: 

Corollary W. When Algorithm P is applied K(4/3) n times with N = 2n to a 
set of satisfiable ternary clauses，its success probability exceeds 1 — e~ K ! 2 . | 

If the clauses (7i 八…八 C m are unsatisfiable, Algorithm P will never demon¬ 
strate that fact conclusively. But if we repeat it 100(4/3) n times and get no 
solution, Corollary W tells us that the chances of satisfiability are incredibly 
small (less than 10 -21 ). So it’s a safe bet that no solution exists in such a case. 

Thus Algorithm P has a surprisingly good chance of finding solutions “with 
its eyes dosed，” while walking at random in the gigantic space of all 2 n binary 
vectors; and we can well imagine that even better results are possible if we devise 
randomized walking methods that proceed with eyes wide open. Therefore many 
people have experimented with strategies that try to make intelligent choices 
about which direction to take at each flip-step. One of the simplest and best of 
these improvements, popularly known as Walks AT, was devised by B. Seim an 5 

H. A. Kautz, and B. Cohen [Nat. Coni. Artificial Intelligence 12 (1994), 337-343]: 

Algorithm W (WalkSAT). Given m nonempty clauses (7i A • • • A C m on n 
Boolean variables x\ • • .x n , and a “greed - avoidance” parameterthis algorithm 
either finds a solution or terminates unsuccessfully after making N trials. It uses 
auxiliary arrays ci • • • c n ， /o • • • /m-i ， ... k m , and w\ ... w m . 

Wl. [Initialize.] Assign random Boolean values to xi .. .x n . Also set r 4- t ^ 0 
and ci ... c n 0... 0. Then, for 1 < j < m, set kj to the number of true 
literals in Cj] and if kj = 0 , set f r <- Wj t r, and r ^ r + 1 ; or if 

kj = 1 and the only true literal of Cj is Xi or 元 “ set q ^ q + 1. (Now r is 
the number of unsatisfied clauses，and the / array lists them. The number 
Ci is the “cost” or “break count” for variable namely the number of 
additional clauses that will become false if Xi is flipped.) 

W2. [Done?] If r = 0, terminate successfully with solution x\ .. .x n . Otherwise ， 
if t = TV, terminate unsuccessfully. 

W3. [Choose j] Set j i- f q , where q is uniformly random in {0, 1，…， r — 1}. 
(In other words, choose an unsatisfied clause Cj at random, considering 
every such clause to be equally likely; exercise 3.4.1-3 discusses the best 
way to compute q.) Let clause Cj be (L V … V 4). 
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W4. [Choose L] Let c be the smallest cost among the literals {/]_，•••，“}• If 
c = 0, or if c > 1 and U > p where U is uniform in [0 • • 1) ， choose l randomly 
from among the literals of cost c. (We call this a “greedy” choice, because 
flipping l will minimize the number of newly false clauses.) Otherwise 
choose l randomly in {7i， … ， 4 }. 

W5. [Flip L] Change the value of variable |Z|，and update r，ci … c n ，/o … /r-i， 
ki ... k m , wi ... w m to agree with this new value. (Exercise 302 explains 
how to implement steps W4 and W5 efficiently, with computer-friendly 
changes to the data structures.) Set t <— t + 1 and return to W2. | 

If，for example, we try to satisfy the seven clauses of ( 7 ) with Algorithm W， 
as we did earlier with Algorithm P，the choice ^ 1 ^ 23 : 3 X 4 = 0110 violates 234; 
and C 1 C 2 C 3 C 4 turns out to be 0110 in this situation. So step W4 will choose to 
flip X 4 ^ and we’ll have the solution 0111. (See exercise 303.) 

Notice that step W3 focuses attention on variables that need to change. 
Furthermore，a literal that appears in the most unsatisfied clauses is most likely 
to appear in the chosen clause Cj. 

If no cost-free flip is available, step W4 makes nongreedy choices with prob¬ 
ability p. This policy keeps the algorithm from getting stuck in an unsatisfiable 
region from which there’s no greedy exit. Extensive experiments by S. Seitz, 

M. Alava, and P. Orponen [J. Statistical Mechanics (June 2005), P06006:l-27] 
indicate that the best choice of p is .57 when large random 3SAT problems are 
being tackled. For example，with this setting of and with m = 4.2n random 
3-literal clauses, Algorithm W works fantastically well: It tends to find solutions 
after making fewer than 10,000n flips when n = 10 4 , and fewer than 2500n flips 
when 10 5 <n< 10 6 . 

What about the parameter N? Should we set it equal to 2n (as recom¬ 
mended for 3SAT problems with respect to Algorithm P)，or perhaps to n 2 (as 
recommended for 2SAT in exercise 299)，or to 2500n (as just mentioned for 3SAT 
in Algorithm W)，or to something else? When we use an algorithm like Walks AT， 
whose behavior can vary wildly depending on random choices and on unknown 
characteristics of the data, it’s often wise to “cut our losses” and to start afresh 
with a brand new pattern of random numbers. 

Exercise 306 proves that such an algorithm always has an optimum cutoff 
value N = N\ which minimizes the expected time to success when the algorithm 
is restarted after each failure. Sometimes TV* = oo is the best choice，meaning 
that we should always keep plowing ahead; in other cases N" is quite small. 

But TV* exists only in theory，and the theory requires perfect knowledge of 
the algorithm’s behavior. In practice we usually have little or no information 
about how N should best be specified. Fortunately there’s still an effective way 
to proceed, by using the notion of reluctant doubling introduced by M. Luby, 
A. Sinclair, and D. Zuckerman [Information Proc. Letters 47 (1993)，173—180]， 
who defined the interesting sequence 

1 , 1 , 2 , 1 , 1 , 2 ,4, 1 , 1 , 2 , 1 , 1 , 2 ,4, 8 , 1 , 1 , 2 , 1 , 1 , 2 ,4, 1 , 1 , 2 ,.... ( 130 ) 
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The elements of this sequence are all powers of 2. Furthermore we have 5 n+ i = 
2S n if the number S n has already occurred an even number of times, otherwise 
5 n+ i = 1. A convenient way to generate this sequence is to work with two 
integers (u ， v), and to start with = ( 1 ， 1 ); then 

(以 n+l ，幻 n+ 1 ) 二 (^n ^ — Un 二 幻 n? (Mn + 1， 1): i^n i 2 〜)) . (工3工) 

The successive pairs are (1,1), (2,1), (2,2), (3,1), (4,1), (4,2), (4,4), (5,1),, 
and we have S n = v n for all n > 1. 

The reluctant doubling strategy is to run Algorithm W repeatedly with 
N = cSi^ cS 2 ^ cSs^ …， until success is achieved, where c is some constant. 
Exercise 308 proves that the expected running time X obtained in this way 
exceeds the optimum by at most a factor of O(logX). Other sequences besides 
(S n ) also have this property，and they’re sometimes better (see exercise 311). 
The best policy is probably to use {cS n 、, where c represents our best guess 
about the value of TV*; in this way we hedge our bets in case c is too small. 

The Local Lemma. The existence of particular combinatorial patterns is often 
established by using a nonconstructive proof technique called the “probabilistic 
method，” pioneered by Paul Erdos. If we can show that Pr(X) > 0， in some 
probability space，then X must be true in at least one case. For example [Bull. 
Amer. Math. Soc. 53 (1947), 292-294], Erdos famously observed that there is a 
graph G on n vertices such that neither G nor G contains a A: - clique, whenever 

⑺ < 2 ^- 1 )/ 2 - 1 . ( 132 ) 


For if we consider a random graph each of whose ⑵ edges is present with 
probability 1/2, and if U is any particular subset of k vertices in the proba¬ 
bility that either G \ U or G 1is a complete graph is clearly 2/2 k( ^ k ~ 1 ^ 2 . Hence 
the probability that this doesn’t happen for any of the ⑵ subsets U is at least 

1 — This probability is positive; so such a graph must exist. 

The proof just given does not provide any explicit construction. But it does 
show that we can find such a graph by making at most 1/(1 — (^) 2 1_/c ^ _1 ^/ 2 ) 
random trials，on the average, provided that n and k are small enough that we 
are able to test all (^) subgraphs in a reasonable amount of time. 

Probability calculations of this kind are often complicated by dependencies 
between the random events being considered. For example, the presence of a 
clique in one part of a graph affects the likelihood of many other cliques that 
share some of the same vertices. But the interdependencies are often highly 
localized, so that “remote” events are essentially independent of each other. 
Laszlo Lovasz introduced an important way to deal with such situations early in 
the 1970s，and his approach has become known as the “Local Lemma” because it 
has been used to establish many theorems. First published as a lemma on pages 
616-617 of a longer paper [Erdos and Lovasz, Infinite and Finite Sets, Colloquia 
Math. Soc. Janos Bolyai 10 (1975)，609-627]，and subsequently extended to a 
“lopsided” form [P. Erdos and J. Spencer, Discrete Applied Math. 30 (1991), 
151—154]，it can be stated as follows: 
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Lemma L. Let _Ai ， •… ， A m be events in some probability space. Let G be a 
graph on vertices { 1 ， … ，爪 }，and let ( 仍， … ^Pm) be numbers such that 

Pr (烏 I fl • • • n A jk ) < Pi whenever k > 0 and , i ~t~h. (>33) 

Then Pr(^i fl • • • fl A m ) > 0 whenever (pi ， • • • ， p m ) lies in a certain set 7Z(G). | 

In applications we think of the Aj as “bad” events, which are undesirable 
conditions that interfere with whatever we’re trying to find. The graph G is 
called a “lopsidependency graph” for our application; this name was coined as 
an extension of Lovasz’s original term “dependency graph,” for which the strict 
condition c = was assumed in place oi c <pi in ( 133 ). 

The set 71(G) of probability bounds for which we can guarantee that all bad 
events can simultaneously be avoided，given ( 133 )，will be discussed further be¬ 
low. If G is the complete graph K m ^ so that ( 133 ) simply states that Pr(^) < 队， 
尺 (G) is clearly {(pi,... ， p m ) | (Pi ， ... ， Pm) > ( 0 ,..., 0 ) and pi + • • • + p m < 1 }； 
this is the smallest possible 71(G). At the other extreme, if G is the empty 
graph K m , we get {(pi ， … ， p m ) | 0 < pj < 1 for 1 < j < m}，the largest possible 
7Z(G). Adding an edge to G makes 1Z(G) smaller. Notice that, if (pi ， … ^p m ) is 
in 1Z(G) and 0 < < pj for 1 < j < m, then also (pi，• • • G 1Z(G). 

Lovasz discovered an elegant local condition that suffices to make Lemma L 

widely applicable [see J. Spencer, Discrete Math. 20 (1977), 69-76]: 

Theorem L. The probability vector (pi ， … is in 71(G) when there are 
numbers 0 < ... ^0 m < 1 such that 

Pi = 0i (l-dj). ( 134 ) 

%— j in G 

Proof. Exercise 344(e) proves that Pr(Ai fl... fl A m ) > (1 — — 6 m ). | 

James B. Shearer [Combinatorica 5 (1985) ， 241-245] went on to determine 
the exact maximum extent of 71(G) for all graphs as we’ll see later; and he 
also established the following important special case: 

Theorem J. Suppose every vertex of G has degree < where d > 1. Then 
(p,..., p) G 1Z(G) when p < (d — l) d ~ l /d d . 

Proof. See the interesting inductive argument in exercise 317. | 

This condition on p holds whenever p < l/(ed) (see exercise 319). 

Further study led to a big surprise: The Local Lemma proves only that 
desirable combinatorial patterns exist, although they might be rare. But Robin 
Moser and Gabor Tardos discovered [JACM 57 (2010), 11:1-11:15] that we can 

efficiently compute a pattern that avoids all of the bad using an almost 
unbelievably simple algorithm analogous to Walks AT! 

Algorithm M (Local resampling). Given m events {_Ai ， … , A m } that depend 
on n Boolean variables {$i ， … ，： r n }，this algorithm either finds a vector x\ .. .x n 
for which none of the events is true，or loops forever. We assume that Aj is a 
function of the variables {xk \ k ^ Ej} for some given subset Ej C {1 ， •… ， n}. 
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Whenever the algorithm assigns a value to 抑 ， it sets Xk ^ I with probability 
and Xk ^ 0 with probability 1 — where ^ is another given parameter. 

Ml. [Initialize.] For 1 < k < set Xk ^ [U < 心 ]， where U is uniform in [0 .. 1). 

M2. [Choose j] Set j to the index of any event such that Aj is true. If no such j 
exists，terminate successfully，having found a solution x\ • • .x n . 

M3. [Resample for Aj] For each k G 3j, set Xk [U <^k]^ where U is uniform 
in [0 .. 1). Return to M2. | 

(We have stated Algorithm M in terms of binary variables Xk purely for conve¬ 
nience. The same ideas apply when each Xk has a discrete probability distribution 
on any set of values，possibly different for each k.) 

To tie this algorithm to the Local Lemma，we assume that event Ai holds 
with probability < pi whenever the variables it depends on have the given 
distribution. For example, if Ai is the event u xs / then pi must be at 

least 心 (1 - 《 5 ) + (1 - 《 3 ) 《 5 . 

We also assume that there’s a graph G on vertices {1 ， … ， m} such that 
condition ( 133 ) is true, and that i —— j whenever i / j and D Ej / 0. Then 
G is a suitable dependency graph for {^ ， … ， A m }，because the events Aj 1 ^ 

• • • ， Aj h can’t possibly influence Ai when i + ji, •. ” i + jk. (Those events 
share no common variables with Ai.) We can also sometimes get by with fewer 
edges by making G a lopsidependency graph; see exercise 351. 

Algorithm M might succeed with any given events，purely by chance. But 
if the conditions of the Local Lemma are satisfied, success can be guaranteed: 

Theorem M. If ( 133 ) holds with probabilities that satisfy condition ( 134 ) of 
Theorem step M3 is performed for Aj at most 0j/(l — 0j) times^ on average. 

Proof. Exercise 352 shows that this result is a corollary of the more general 
analysis that is carried out below. The stated upper bound is good news, because 
0j is usually quite small. | 

Traces and pieces. The best way to understand why Algorithm M is so efficient 
is to view it algebraically in terms of “traces •” The theory of traces is a beautiful 
area of mathematics in which amazingly simple proofs of profound results have 
been discovered. Its basic ideas were first formulated by P. Cartier and D. Foata 
[Lecture Notes in Math. 85 (1969)]，then independently developed from another 
point of view by R. M. Keller [JACM 20 (1973), 514-537, 696-710] and A. 

Mazurkiewicz [“Concurrent program schemes and their interpretations，” DAIMI 
Report PB 78 (Aarhus University，July 1977)]. Significant advances were made 
by G. X. Viennot [Lecture Notes in Math. 1234 (1985), 321-350], who presented 

many wide-ranging applications and explained how the theory could readily be 
visualized in terms of what he called “heaps of pieces •” 

Trace theory is the study of algebraic products whose variables are not 
necessarily commutative. Thus it forms a bridge between the study of strings 
(in which, for example, acbbaca is quite distinct from baccaab) and the study 
of ordinary commutative algebra (in which both of those examples are equal to 
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aaabbcc = a 3 b 2 c 2 ). Each adjacent pair of letters {a, 6 } either commutes, meaning 
that ab = ba，or clashes, meaning that ab is different from ba. If，for instance ， 
we specify that a commutes with c but that b clashes with both a and c，then 
acbbaca is equal to cabbaac, and it has six variants altogether; similarly, there 
are ten equally good ways to write baccaab. 

Formally speaking，a trace is an equivalence class of strings that can be 
converted to each other by repeatedly interchanging pairs of adjacent letters that 
don’t clash. But we don’t need to fuss about the fact that equivalence classes 
are present; we can simply represent a trace by any one of its equivalent strings ， 
just as we don’t distinguish between equivalent fractions such as 1/2 and 3/6. 

Every graph whose vertices represent distinct letters defines a family of 
traces on those letters, when we stipulate that two letters clash if and only 
if they are adjacent in the graph. For example，the path graph a —— b —— c 
corresponds to the rules stated above. The distinct traces for this graph are 


e ， a ， 6 , c ， aa ， ab, ac ， ha, bb, be, c 6 , cc ， aaa ， aab, … ， ccb, ccc ， aaaa， … ( 135 ) 


if we list them first by size and then in lexicographic order. (Notice that ca 
is absent, because ac has already appeared.) The complete graph K n defines 
traces that are the same as strings，when nothing commutes; the empty graph 
K n defines traces that are the same as monomials, when everything commutes. If 
we use the path a —— b —— c —— d —— e —— f to define clashes, the traces bcebafdc 
and efbcdbca turn out to be the same. 

Viennot observed that partial commutativity is actually a familiar concept, 
if we regard the letters as “pieces” that occupy “territory •” Pieces clash if and 
only if their territories overlap; pieces commute if and only if their territories are 
disjoint. A trace corresponds to stacking the pieces on top of one another，from 
left to right, letting each new piece “fall” until it either rests on the ground or on 
another piece. In the latter case, it must rest on the most recent piece with which 
it clashes. He called this configuration an empilement —— a nice French word. 

More precisely，each piece a is assigned a nonempty subset T(a) of some 
universe，and we say that a clashes with b if and only if T(a) D T(b) / 0. For 
example, the constraints of the graph a —— b —— c —— d —— e —— f arise when we let 

T(a) = {l ， 2}, T( 6 ) = {2,3}, T(c) = {3,4 }， ...， T(/) = {6,7} ; 


then the traces bcebafdc and efbcdbca both have 



(136) 


as their empilement. (Readers who have played the game of Tetris ⑧ will imme¬ 
diately understand how such diagrams are formed，although the pieces in trace 
theory differ from those of Tetris because they occupy only a single horizontal 
level. Furthermore, each type of piece always falls in exactly the same place; and 
a piece’s territory T(a) might have “holes” 一 it needn’t be connected.) 
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Two traces are the same if and only if they have the same empilement. In 
fact, the diagram implicitly defines a partial ordering on the pieces that appear; 
and the number of different strings that represent any given trace is the number 
of ways to sort that ordering topologically (see exercise 324). 

Every trace a has a length, denoted by |a|，which is the number of letters in 
any of its equivalent strings. It also has a height, written h(a), which is the num¬ 
ber of levels in its empilement. For example ， \bcebafdc\ = 8 and h(bcebafdc) = 4. 


Arithmetic on traces. To multiply traces，we simply concatenate them. If, 
for example, a = bcebafdc is the trace corresponding to ( 136 )，then aa R = 
beebafdccdfabecb has the following empilement: 



The algorithm in exercise 327 formulates this procedure precisely. A moment’s 
thought shows that |a/?| = |a| + |/?|， h{a(3) < h(a) + h(/3) ， and h(aa R ) = 2h(a). 

Traces can also be divided，in the sense that a = (aP)/P can be determined 
uniquely when and (3 are given. All we have to do is remove the pieces of (3 
from the pieces of a/3, one by one，working our way down from the top of the 
empilements. Similarly, the value of ^ = a \ (ap) can be computed from the 
traces a and a/3. (See exercises 328 and 329.) 

Notice that we could rotate diagrams like ( 136 ) and ( 137 ) by 90 degrees ， 
thereby letting the pieces “fall” to the left instead of downwards. (We’ve used a 
left-to-right approach for similar purposes in Section 5.3.4, Fig. 50.) Or we could 
let them fall upwards, or to the right. Different orientations are sometimes more 
natural, depending on what we’re trying to do. 

We can also add and subtract traces, thereby obtaining polynomials in vari¬ 
ables that are only partially commutative. Such polynomials can be multiplied 
in the normal way; for example, (a + ^)(7 — 8) = — a8 + /3j — /38. Indeed, 

we can even work with infinite sums，at least formally: The generating function 
for all traces that belong to the graph a —— b —— c is 

l + a +6 + c+aa + a 6 +ac+ 6 a +66 + 6 c+c 6 +cc+aaa+- • • + ccc+aaaa+-" • ( 138 ) 

(Compare with ( 135 ); we now use 1， not e，to stand for the empty string.) 

The infinite sum ( 138 ) can actually be expressed in closed form: It equals 


1 


1 + (a+ 6 +c—ac) + (a+ 6 +c—ac ) 2 + 


1 — a —b — c + ac 


# • # 


039 ) 


topological sorting 
length of a trace 
height of a trace 
multiplication of traces 
division of traces 
right division of traces 
polynomials 
generating function 
empty string 
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an identity that is correct not only when the variables are commutative, but also 
in the algebra of traces, when variables commute only when they don’t clash. 

In their original monograph of 1969, Cartier and Foata showed that the sum 
of all traces with respect to any graph can be expressed in a remarkably simple 
way that generalizes ( 139 ). Let’s define the Mdbius function of a trace a with 
respect to a graph G by the rule 





(-ir 


if ho{(y) > 1 ； 

otherwise. 


( 140 ) 


(The classical Mdbius function fi(n) for integers，defined in exercise 4.5.2— 10, is 
analogous.) Then the Mdbius series for G is defined to be 


M G = E//G(a)a, ( 141 ) 

a 

where the sum is over all traces. This sum is a polynomial, when G is finite ， 
because it contains exactly one nonzero term for every independent set of vertices 
in G] therefore we might call it the Mdbius polynomial For example, when G is 
the path a —— b —— c，we have Mq — 1 — a — 6 — c + ac，the denominator in ( 139 ) • 
Cartier and Foata’s generalization of ( 139 ) has a remarkably simple proof: 

Theorem F. The generating function Tq for the sum of all traces ，with respect 
to any graph G, is 1 /Mg- 


Proof. We want to show that MqTq = 1， in the (partially commutative) algebra 
of traces. This infinite product is ^ ^ #G(oO [7 = Hence 

we want to show that the sum of //g(qO, over all ways to factorize 7 = as the 
product of two traces a and ^ is zero whenever 7 is nonempty. 

But that’s easy. We can assume that the letters are ordered in some arbitrary 
fashion. Let a be the smallest letter in the bottom level of 7 ’s empilement. We 
can restrict attention to cases where a consists of independent (commuting) let¬ 
ters (pieces)，because ⑽⑹ = 0 otherwise. Now if a = aa’ for some trace a’，let 
/3 1 = otherwise we must have (3 = a/3 1 for some trace 0’， and we let a 1 = aa. 
In both cases a/3 = a’0’ ， (a f y = a ， ( y 5 / ) / = 0， and fio («) + I^g ) = 0. So we’ve 
grouped all possible factorizations of 7 into pairs that cancel out in the sum. | 

The Mdbius series for any graph can be computed recursively via the formula 

M G = M G \ a - aM G \ a ,, a* = {a} U{b\ a — 6 }, ( 142 ) 

where a is any letter (vertex) of because we have a ^ / or a G / whenever / 
is independent. For example, if G is the path a —— b —— c —— d —— e —— /， then 
G\a* = I {c ， d ， e，/} is the path c —— d —— e —— /; repeated use of ( 142 ) yields 


Mq = 1 — a — b — c — d — e — f + ac + ad + ae + af 

+ bd + be + bf + ce + cf + df — ace — acf — adf — bdf ( 143 ) 


in this case. Since Mq is a polynomial, we can indicate its dependence on the 
variables by writing Mc?(a ， 6 , c ， d ， e，/)• Notice that Mq is always multilinear 
(this is，linear in each variable); and M G \ a (b ， c ， d ， e, f) = Mg( 0 , 6 , c ， d ， e，/)• 


Cartier 

Foata 

Mobius function of a trace 
Mobius function 
Mobius series 
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In applications we often want to replace each letter in the polynomial by 
a single variable，such as 2 ： ， and write Mq(z). The polynomial in ( 143 ) then 
becomes Mq(z) = l — 6z + 10z 2 —4 ： z 3 ] and we can conclude from Theorem F that 
the number of traces of length n with respect to G is [z n ] 1/(1 — 6: + 10 : 2 — 4: 3 )= 

;(2 + \/ 2) n+2 + K 2 - \/ 2)^ 2 - 2 n+1 . 

Although ( 142 ) is a simple recurrence for M。，we can’t conclude that Mq 
is easy to compute when G is a large and complicated graph. Indeed, the degree 
of Mq is the size of a maximum independent set in G] and it’s NP-hard to 
determine that number! On the other hand, there are many classes of graphs ， 
such as interval graphs and forests，for which Mq can be computed in linear time. 

If a is any trace，the letters that can occur first in a string that represents 
it are called the sources of a; these are the pieces on the bottom level of a’s 
empilement，also called its minimal pieces. Dually，the letters that can occur 
last are the sinks of a, its maximal pieces. A trace that has only one source is 
called a cone] in this case all pieces are ultimately supported by a single piece 
at the bottom. A trace that has only one sink is ， similarly，called a pyramid. 
Viennot proved a nice generalization of Theorem F in his lecture notes: 


Mq\a/ Mq is the sum of all traces whose sources are contained in A. ( 144 ) 

(See exercise 338; Theorem F is the special case where A is the set of all vertices.) 
In particular, the cones for which a is the only source are generated by 


G\a/M G 


a M G \ a * / 


045) 


* Traces and the Local Lemma. Now we’re ready to see why the theory of 
traces is intimately connected with the Local Lemma. If G is any graph on 
the vertices {1 ， … ， m}，we say that 1Z(G) is the set of all nonnegative vectors 
(Pi ， • • • ， Pm) such that Mq( p[ ，•… Q whenever 0 < pj < pj for 1 < j < m. 
This definition of 71(G) is consistent with the implicit definition already given 
in Lemma L，because of the following characterization found by J. B. Shearer: 

Theorem S. Under condition (133) of Lemma ( 仍， … ^p m ) G 1Z(G) implies 


'Pr(A 1 n ... n A m ) > . ,p m ) > 0. ( 146 ) 

Conversely，if (pi ， … ,Pm) ♦ 尺 (G )，there are events … ， B m such that 
Pr(Bi I Bj 1 fl • • • fl Bj k ) = pi whenever ^ > 0 and i+ j\, … ， i-/-jk^ ( 147 ) 
and Pr(Bi 门 ... fl B m ) = 0. 

Proof. When {pi, ••” p m ) ^ exercise 344 proves that there’s a unique 

distribution for events ， B m such that they satisfy ( 147 ) and also 



jeJ 


> = M G (Pi[l e J] ， ... ， p m [me J]) 

托 J 


( 148 ) 


for every subset J C {1 ， •… ， m}. In this “extreme” worst-possible distribution ， 
Pr(Bi fl Bj) = 0 whenever i —— j in G. Exercise 345 proves the converse. | 


maximum independent set 

NP-hard 
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sinks 
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Given a probability vector (pi, … ， : Pm), let 


^g( z ) = m g(Piz ， … ， p m z). 


(i49) 


Theorem F tells us that the coefficient of z n in the power series 1/Mq(z) is the 
sum of all traces of length n for G. Since this coefficient is nonnegative，we know 
by Pringsheim’s theorem (see exercise 348) that the power series converges for 


all 2 ： < 1 + 5, where 1 + 5 is the smallest real root of the polynomial equation 
Mq(z) = 0 ; this number 8 is called the slack of {pi ， Pm) with respect to G. 

It’s easy to see that (pi ， … ^p m ) G 71(G) if and only if the slack is positive. 
For if 5 < 0 , the probabilities (p [,... ,0 with pj = (1 + S)pj make Mq — 0 . 
But if 6 > 0 , the power series converges when z = 1. And (since it represents 


the sum of all traces) it also converges to the positive number 1 /Mq if any pj is 
decreased; hence ( 仍， … ， : p m ) lies in 1Z(G) by definition. Indeed，this argument 
shows that, when (pi， … G 1 Z(G )，we can actually increase the probabilities 
to ((1 + e)pi，•••，（!_ + e)p m )，and they will still lie in 1Z(G) whenever e < 6 . 


Let’s return now to Algorithm M. Suppose the successive bad events Aj 
that step M3 tries to quench are Xi ， … ， X/v，where N is the total number 
of times step M3 is performed (possibly N = 00 ) • To prove that Algorithm M is 
efficient，we shall show that this random variable N has a small expected value ， 
in the probability space of the independent uniform deviates U that appear in 
steps Ml and M3. The main idea is that XiX 2 ... Xn is essentially a trace for 
the underlying graph; hence we can consider it as an empilement of pieces. 

Some simple and concrete examples will help to develop our intuition; we 
shall consider two case studies. In both cases there are m = 6 events C ^ 

D, E, F, and there are n = 7 variables x\ .. .X 7 . Each variable is a random bit; 
thus = • • • = ^7 = 1/2 in the algorithm. Event A depends on X\X 2 ^ while B 
depends on 奶 ❿，…， and F depends on xqx^. Furthermore, each event occurs 
with probability 1/4. In Case 1 ， each event is true when its substring is c 10 ’； thus 
all events are false if and only if xi .. .X 7 is sorted — that is^i < ^2 < • • • < xj. 
In Case 2, each event is true when its substring is C 1V; thus all events are false 
if and only if xi .. .X 7 has no two consecutive Is. 

What happens when we apply Algorithm M to those two cases? One 
possible scenario is that step M3 is applied N = 8 times，with XiX 2 ... X 8 = 
BCEBAFDC . The actual changes to the bits x\ .. .xj might then be 

Case 1 Case 2 


Pringsheim 

slack 

consecutive Is 



(Read xi .. .X 7 from top to bottom in these diagrams，and scan from left to right. 
Each module c \ ^ means “replace the two bad bits at the left by two random bits 
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at the right.” In examples such as this，any valid solution xi .. .X 7 can be placed 
at the far right; all values to the left of the modules are then forced.) 

Notice that these diagrams are like the empilement ( 136 )，except that they’ve 
been rotated 90°. We know from ( 136 ) that the same diagram applies to the 
scenario EFBCDBCA as well as to BCEBAFDC, because they’re the same, as 
traces. Well • • • ， not quite! In truth, EFBCDBCA doesn’t give exactly the same 
result as BCEBAFDC in Algorithm M, if we execute that algorithm as presently 
written. But the results would be identical if we used separate streams of 
independent random numbers Uk for each variable Xk. Thus we can legitimately 
equate equivalent traces, in the probability space of our random events. 

The algorithm runs much faster in practice when it’s applied to Case 1 than 
when it’s applied to Case 2. How can that be? Both of the diagrams in ( 150 ) 
occur with the same probability, namely (l/2) 7 (l/4) 8 , as far as the random num¬ 
bers are concerned. And every diagram for Case 1 has a corresponding diagram 
for Case 2; so we can’t distinguish the cases by the number of different diagrams. 
The real difference comes from the fact that, in Case 1， we never have two events 
to choose from in step M2, unless they are disjoint and can be handled in either 
order. In Case 2, by contrast, we are deluged at almost every step with events 
that need to be snuffed out. Therefore the scenario at the right of ( 150 ) is actually 
quite unlikely; why should the algorithm pick B as the first event to correct, and 
then (7, rather than A? Whatever method is used in step M2, we’ll find that 
the diagrams for Case 2 will occur less frequently than dictated by the strict 
probabilities, because of the decreasing likelihood that any particular event will 
be worked on next, in the presence of competing choices. (See also exercise 353.) 

Worst-case upper bounds on the running time of Algorithm M therefore 
come from situations like Case 1. In general, the empilement BCEBAFDC 
in ( 150 ) will occur in a run of Algorithm M with probability at most bcebafdc, 
if we write for the probabilistic upper bound for event A that is denoted by 
Lf Pi in ( 133 ) when A is 為 ， and if c b\ •••，‘/’ are similar for … ， F. The 
reason is that bcebafdc is clearly the probability that those events are produced 
by the independent random variables Xk set by the algorithm, if the layers of 
the corresponding empilement are defined by dependencies between the variable 
sets Ej. And even if events in the same layer are dependent (by shared variables) 
yet not lopsidependent (in the sense of exercise 351)，such events are positively 
correlated; so the FKG inequality of exercise MPR— 61， which holds for the 
Bernoulli-distributed variables of Algorithm M，shows that bcebafdc is an upper 
bound. Furthermore the probability that step M2 actually chooses B ， C ， E ， B, 
D, and C to work on is at most 1. 

Therefore，when (pi，• • • G 11(0 )，Algorithm M’s running time is max¬ 
imized when it is applied to events …， B m that have the extreme distri¬ 
bution ( 148 ) of exercise 344. And we can actually write down the generating 
function for the running time with respect to those extreme events: We have 


^ Pr(Algorithm M on 氏 ，… ， B m does N resamplings) 2 ：^ = 

N>0 


够) 

mW) 


051) 


handwaving 
FKG inequality 
Bernoulli-distributed 
extreme distribution 
generating function 
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where Mq(z) is defined in ( 149 )，because the coefficient of z N in 1/Mq(z) is the 
sum of the probabilities of all the traces of length N • Theorem F describes the 
meaning of 1 /Mq{ 1 ) as a “formal” power series in the variables we proved 
it without considering whether or not the infinite sum converges when those 
variables receive numerical values. But when (pi, … ^p m ) G 1Z(G), this series is 
indeed convergent (it even has a positive “slack”）. 

This reasoning leads to the following theorem of K. Kolipaka and M. Szegedy 

[STOC 43 (2011), 235-243] : 

Theorem K. If (px” • • ^p m ) G 7^((7) ? Algorithm M resamples Ej at most 

Ej = PjM G \ A , {pi,... ,Pm)/M G (p 1 ,... ,p m ) ( 152 ) 

J 

times，on the average. In particular，the expected number of iterations of step M3 
is at most 五 1 + • • • + E m < m! 8, where 6 is the slack ， p m ). 


Proof. The extreme distribution •… ， B m maximizes the number of times Ej 
is resampled，and the generating function for that number in the extreme case is 


• • • ，巧 —1 ， Pj ， Pjf+l ， • • • •) Pm) 
-^G (Pi ， • • • ， Pj—1，Pj 之， Pi+1 ， Pm) 


053) 


Differentiating with respect to 2 ：, then setting z <— 1^ gives ( 152 ), because the 
derivative of the denominator is —pjMG\A*{pi ， … ^Pm) by ( 141 ). 

3 

The stated upper bound on E\ H - + E m is proved in exercise 355. | 

^Message passing. Physicists who study statistical mechanics have developed 
a significantly different way to apply randomization to satisfiability problems ， 
based on their experience with the behavior of large systems of interacting 
particles. From their perspective, a set of Boolean variables whose values are 
0 or 1 is best viewed as an ensemble of particles that have positive or negative 
“spin” ； these particles affect each other and change their spins according to local 
attractions and repulsions, analogous to laws of magnetism. A satisfiability 
problem can be formulated as a joint probability distribution on spins for which 
the states of minimum “energy” are achieved precisely when the spins satisfy as 
many clauses as possible. 

In essence, their approach amounts to considering a bipartite structure in 
which each variable is connected to one or more clauses, and each clause is 
connected to one or more variables. We can regard both variables and clauses 
as active agents, who continually tweet to their neighbors in this social network. 
A variable might inform its clauses that “I think I should probably be true”; 
but several of those clauses might reply, U I really wish you were false.^ By 
carefully balancing these messages against each other, such local interactions 
can propagate and build up more and more knowledge of distant connections, 
often converging to a state where the whole network is reasonably happy. 

A particular message-passing strategy called survey propagation [A. Braun- 
stein ， M. Mezard, and R. Zecchina ，Random Structures & Algorithms 27 (2005 )， 
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201-226] has proved to be astonishingly good at solving random satisfiability 
problems in the “hard” region just before the threshold of unsatisfiability. 

Let (7 be a clause and let l be one of its literals. A “survey message” rjc-^i is 
a fraction between 0 and 1 that represents how urgently C wants l to be true. If 
rjc-^i = the truth of l is desperately needed, lest C be false; but if rjc-^i = 0 5 
clause C isn’t the least bit worried about the value of variable |/|. Initially we 
set each rjc-^i to a completely random fraction. 

We shall consider an extension of the original survey propagation method 
[see J. Chavas, C. Furtlehner ， M. Mezard，and R. Zecchina, J. Statistical Me¬ 
chanics (November 2005) ， P11016:l-25; A. Braunstein and R. Zecchina, Physical 
Review Letters 96 (27 January 2006), 030201:1-4], which introduces additional 
^reinforcement messages” rji for each literal l. These new messages, which are 
initially all zero, represent an external force that acts on l. They help to focus 
the network activity by reinforcing decisions that have turned out to be fruitful. 

Suppose v is a variable that appears in just three clauses: positively in A 
and B, negatively in C. This variable will respond to its incoming messages 

%， and % by computing two “flexibility coefficients^ 7 r v 
and 〜 ， using the following formulas: 

〜 = (1 -%)(1 -^ 4 七 )(1 = (1 -%)(1 

If, for instance, V v = = 0 while r] A _^ v = r] B ^ v = 7] c _^' = 2/3, then tt v = 

1/9^ 7 T_ = 1/3. The 7r’s are essentially dual to the rj’s ， because high urgency 
corresponds to low flexibility and vice versa. The general formula for each literal l 
is 

兀 i = (i - 切 ） n (! ( 154 ) 

lec 

Survey propagation uses these coefficients to estimate variable tendency 
to be either 1 (true)，0 (false)，or * (wild )， by computing three numbers 

(1—(1—71^)71^ / 、 

P= — : - ， Q = --- ， r = --- ； (155) 


then p + g + r = 1， and (p ， g ， r) is called the “field” of r，representing re¬ 


spectively (truth, falsity ， wildness). The 
field turns out to be (8/11,2/11,1/11) 
in our example above, indicating that v 
should probably be assigned the value 1. 
But if 7]a-^v and t}b-^ v had been only 
1/3 instead of 2/3, the field would have 
been (5/17, 8/17, 4/17)，and we would 
probably want v = 0 in order to sat¬ 
isfy clause C. Figure 51 shows lines of 
constant p — g as a function of and 
7 r^; the most decisive cases (\p — g| ^ 1) 
occur at the lower right and upper left. 


l 




0 

Fig. 51. Lines of constant 
bias in a variable’s “field •” 
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If = 丌 。 = 0 ， there 5 s no flexibility at all: Variable v is being asked to 
be both true and false. The field is undefined in such cases, and the survey 
propagation method hopes that this doesn’t happen. 

After each literal l has computed its flexibility, the clauses that involve l or l 
can use n l and ttj to refine their survey messages. Suppose, for example, that C 
is the clause u\/ v W w. It will replace the former messages r\ c — u ， Vc-^w 

by 

^lC-^u = 5 ^lC-^v ~ ^u-^C^w^-C 5 VC—^w ~ ^u^C^v-^C •> 

where each is a u bias message” received from literal 


= 


(1 -7T r )7T"(l 

丌『 + ( 1 一 71 >)丌/八1一呢~^) 


(邮) 


reflecting Vs propensity to be false in clauses other than C. In general we have 


Vc-^i 


vec 


li-^c 


057) 


(Appropriate conventions must be used to avoid division by zero in formulas 
( 156 ) and ( 157 ); see exercise 359.) 

New reinforcement messages can also be computed periodically, using the 
formula 

% = : n (巧 8 ) 

a + 丌 

for each literal l] here x — y denotes max(x — y ， 0 )，and k is a reinforcement 
parameter specified by the algorithm. Notice that > 0 only if = 0. 

For example, here are messages that might be passed when we want to satisfy 
the seven clauses of ( 7 ): 

h h h Vc-^h Vc-^h Vc-^h li 2 -^c 

1 2 3 0 0 0 3/5 0 0 

1 2 3 1/5 0 0 0 3/5 1/3 

2 3 4 1/5 0 0 0 1/3 3/5 

2 3 4 0 0 0 3/5 0 0 

1 3 4 0 0 1/5 3/5 1/3 0 

1 3 4 0 0 0 0 0 3/5 

1 2 4 0 0 0 0 0 0 



(Recall that the only solutions to these clauses are 1234 and 1234.) In this 
case the reader may verify that the messages of ( 159 ) constitute a “fixed point”: 
The rj messages determine the 7 r’s; conversely, we also have — h for all 
clauses C and all literals if the reinforcement messages 切 remain constant. 

Exercise 361 proves that every solution to a satisfiable set of clauses yields a 
fixed point of the simultaneous equations ( 154 ) ，（ 156 ) ， ( 157 ), with the property 
that 7]i = [l is true in the solution]. 
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Experiments with this message-passing strategy have shown, however, that 
the best results are obtained by using it only for preliminary screening, with the 
goal of discovering variables whose settings are most critical; we needn’t continue 
to transmit messages until every clause is fully satisfied. Once we’ve assigned 
suitable values to the most delicate variables, we’re usually left with a residual 
problem that can readily be solved by other algorithms such as Walks AT. 

The survey, reinforcement, and bias messages can be exchanged using a wide 
variety of different protocols. The following procedure incorporates two ideas 
from an implementation prepared by C. Baldassi in 2012: (1) The reinforcement 
strength k begins at zero, but approaches 1 exponentially. (2) Variables are rated 
1 ， 0, or 氺 after each reinforcement, according as max(p 5 g ， r) in their current field 
is g，or r. If every clause then has at least one literal that is true or *，message 
passing will cease even though some surveys might still be fluctuating. 

Algorithm S (Survey propagation). Given m nonempty clauses on n variables, 
this algorithm tries to assign values to most of the variables in such a way that 
the still - 皿 satisfied clauses will be relatively easy to satisfy. It maintains arrays 
7T/ and r]i of floating-point numbers for each literal as well as r}c-M for each 
clause C and each l e C. It has a variety of parameters: p (the damping factor 
for reinforcement) ， Nq and N (the minimum and maximum iteration limits )， 
e (the tolerance for convergence)，and ^ (the confidence level). 

51. [Initialize.] Set ^ 7T/ 4- 0 for all literals and rjc-^i U for all clauses C 

and l G C^ where U is uniformly random in [0 • • 1). Also set i 4- 0, 卜 1. 

52. [Done?] Terminate unsuccessfully if i > If i is even or < < iV 0 , go to S5. 

53. [Reinforce.] Set (j) i- pcj) and k i- 1 — (/). Replace 7] t by for all literals 
using (158); but terminate unsuccessfully if 丌 ； = 7Tj = 0. 

54. [Test pseudo-satisfiability.] Go to S5 if there is at least one clause whose 
literals l all appear to be false，in the sense that tij < / k 1 and itj < \ (see 
exercise 358). Otherwise go happily to S8. 

55. [Compute the 7r’s.] Compute each 丌 “ using (154); see also exercise 359. 

56. [Update the surveys.] Set 6^0. For all clauses C and literals l G C^ com- 

pute rih using (157), and set S max((5, - Vc-^i vh. 

57. [Loop on i] If 5 > e, set i i + 1 and return to S2. 

58. [Reduce the problem.] Assign a value to each variable whose field satisfies 
\p — q\ > (Exercise 362 has further details.) | 

Computational experience — otherwise known as trial and error — suggests 
suitable parameter values. The defaults p — .995, 7V 0 = 5, TV = 1000, e = .01 ， 
and ^ = .50 seem to provide a decent starting point for problems of modest 
size. They worked well, for instance, when the author first tried a random 3SAT 
problem with 42,000 clauses and 10,000 variables: These clauses were pseudo - 
satisfiable when i = 143 (although 6 ^ .43 was still rather large); then step S8 
fixed the values of 8,282 variables with highly biased fields，and unit propagation 
gave values to 57 variables more. This process needed only about 218 megamems 
of calculation. The reduced problem had 1526 2-clauses and 196 3-clauses on 
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1464 variables (because many other variables were no longer needed); 626 steps 
of Walks AT polished it off after an additional 42 kilomems. By contrast, when 
Walks AT was presented with the original problem (using p = .57), it needed 
more than 31 million steps to find a solution after 3.4 gigamems of computation. 

Similarly，the author’s first experience applying survey propagation to a 
random 3SAT problem on n = 10 6 variables with m = 4.2n clauses was a 
smashing success: More than 800,000 variables were eliminated after only 32.8 
gigamems of computation, and Walks AT solved the residual clauses after 8.5 
megamems more. By contrast, pure Walks AT needed 237 gigamems to perform 
2.1 billion steps. 

A million - variable problem with 4,250,000 clauses proved to be more chal¬ 
lenging. These additional 50,000 clauses put the problem well beyond Walks AT ^ 
capability; and Algorithm S failed too, with its default parameters. However, 
the settings p = .9999 and iV 0 = 9 slowed the reinforcement down satisfactorily 5 
and produced some instructive behavior. Consider the matrix 


/ 3988 

3651 

3071 

2339 

1741 

1338 

946 

702 

508 

329 \ 

5649 

5408 

4304 

3349 

2541 

2052 

1448 

1050 
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510 

8497 

7965 

6386 

4918 

3897 

3012 

2248 

1508 

1075 

718 

11807 

11005 

8812 

7019 

5328 

4135 

3117 

2171 

1475 

1063 

15814 

14789 

11726 

9134 

7188 

5425 

4121 

3024 

2039 

1372 

20437 

19342 

15604 

12183 

9397 

7263 

5165 

3791 

2603 

1781 

26455 

24545 

19917 

15807 

12043 

9161 

6820 

5019 

3381 

2263 

33203 

31153 

25052 

19644 

15587 

11802 

8865 

6309 

4417 

2919 

39962 

38097 

31060 

24826 

18943 

14707 

10993 

7924 

5225 

3637 

V40731 

40426 

32716 

26561 

20557 

15739 

11634 

8327 

5591 

4035/ 


which shows the distribution of 7 r^ versus 7T V (see Fig. 51); for example ， ‘3988’ at 
the upper left means that 3988 of the million variables had 7 r^ between 0.0 and 0.1 
and 7T V between 0.9 and 1.0. This distribution, which appeared after 6 had been 
reduced to ^ 0.0098 by 110 iterations，is terrible — very few variables are biased 
in a meaningful way. Therefore another run was made with e reduced to .001; 
but that failed to converge after 1000 iterations. Finally, with e = .001 and 
N = 2000, pseudo-satisfaction occurred at i = 1373, with the nice distribution 


/406678 

1946 

1045 

979 

842 

714 

687 

803 

1298 

167649 \ 

338 

2 

2 

3 

0 

3 

1 

4 

2 

1289 

156 

1 

0 

0 

0 

1 

0 

2 

1 

875 

118 

4 

0 

0 

0 

0 

0 

0 

1 

743 

99 

0 

0 

0 

0 

0 

0 

1 

0 

663 

62 

0 

0 

0 

0 

0 

1 

0 

3 

810 

41 

0 

0 

0 

0 

0 

0 

0 

0 

1015 

55 

0 

0 

0 1 

0 

1 

1 

0 

1139 

63 

0 

0 

1 0 

0 

0 

1 

2 

1949 

\ 116 

61 

72 

41 

61 

103 

120 

162 

327 

406839 / 


(although S was now ^ 1!). The biases were now pronounced，yet not entirely 
reliable; the ^ parameter had to be raised，in order to avoid a contradiction when 
propagating unit literals in the reduced problem. Finally，with = .99, more 
than 800,000 variables could be set successfully. A solution was obtained after 
210 gigamems (including 21 megamems for Walks AT to finish the job，but not 
including the time spent learning how to set the parameters for so many clauses). 


author 
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Success with Algorithm S isn’t guaranteed. But hey, when it works, it’s 
sometimes the only known way to solve a particularly tough problem. 

Survey propagation may be viewed as an extension of the “belief propaga¬ 
tion^ messages used in the study of Bayesian networks [see J. Pearl ，Probabilistic 
Reasoning in Intelligent Systems (1988)，Chapter 4]; it essentially goes beyond 
Boolean logic on {0,1} to a three-valued logic on {0, 1， *}• Analogous message¬ 
passing heuristics had actually been considered much earlier by EL A. Bethe 

and R. E. Peierls [Proc. Royal Society of London A150 (1935), 552-575], and 
independently by R. G. Gallager [IRE Transactions IT-8 (1962), 21-28]. For 
further information see M. Mezard and A. Montanari, Information^ Physics，and 
Computation (2009), Chapters 14-22. 

^Preprocessing of clauses. A SAT-solving algorithm will often run consid¬ 
erably faster if its input has been transformed into an equivalent but simpler 
set of clauses. Such transformations and simplifications typically require data 
structures that would be inappropriate for the main work of a solver, so they are 
best considered separately. 

Of course we can combine a preprocessor and a solver into a single program; 
and “preprocessing” techniques can be applied again after new clauses have 
been learned, if we reach a stage where we want to clean up and start afresh. 
In the latter case the simplifications are called inprocessing. But the basic ideas 
are most easily explained by assuming that we just want to preprocess a given 
family of clauses F. Our goal is to produce nicer clauses F 1 ， which are satisfiable 
if and only if F is satisfiable. 

We shall view preprocessing as a sequence of elementary transformations 

F = Fq ^ Fi ^ F r = jP ’， (160) 

where each step Fj ^ i^ +1 “flows downhill” in the sense that it either (i) elim¬ 
inates a variable without increasing the number of clauses，or (ii) retains all the 
variables but decreases the number of literals in clauses. Many different downhill 
transformations are known; and we can try to apply each of the gimmicks in our 
repertoire, in some order, until none of them lead to any further progress. 

Sometimes we’ll actually solve the given problem, by reaching an F 1 that 
is either trivially satisfiable (0) or trivially unsatisfiable (contains e). But we 
probably won’t be so lucky unless F was pretty easy to start with, because we’re 
going to consider only downhill transformations that are quite simple. 

Before discussing particular transformations ， however, let’s think about the 
endgame: Suppose F has n variables but F 1 has n 1 < n. After we’ve fed the 
clauses F 1 into a SAT solver and received back a solution, x[ ... x' n , , how can 
we convert it to a full solution xi .. .x n of the original problem F? Here’s how: 
For every transformation Fj -> Fj^i that eliminates a variable Xk, we shall 
specify an erp rule (so-called because it reverses the effect of preprocessing). An 
erp rule for elimination is simply an assignment c l 4 - E\ where l is Xk or Xk^ 
and 五 is a Boolean expression that involves only variables that have not been 
eliminated. We undo the effect of elimination by assigning to Xk the value that 
makes l true if and only if E is true. 
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For example, suppose two transformations remove x and y with the erp rules 

x ^ y\/ y i- 1. 

To reverse these eliminations, right to left, we would set y true, then x i- z. 

As the preprocessor discovers how to eliminate variables，it can immediately 
write the corresponding erp rules to a file, so that those rules don’t consume 
memory space. Afterwards, given a reduced solution x[ ... x ' n ,, a postprocessor 
can read that file in reverse order and provide the unreduced solution x\ … x n . 

Transformation 1. Unit conditioning. If a unit clause ‘(/)’ is present, we can 
replace F by F\l and use the erp rule l 4 - 1. This elementary simplification will 
be carried out naturally by most solvers; but it is perhaps even more important 
in a preprocessor, since it often enables further transformations that the solver 
would not readily see. Conversely，other transformations in the preprocessor 
might enable unit conditionings that will continue to ripple down. 

One consequence of unit conditioning is that all clauses of F 1 will have length 
two or more, unless F 1 is trivially unsatisfiable. 

Transformation 2. Subsumption. If every literal in clause C appears also 
in another clause C\ we can remove C 1 . In particular, duplicate clauses will be 
discarded. No erp rule is needed, because no variable goes away. 

Transformation 3. Self-subsumption. If every literal in C except x appears 
also in another clause C’，where C 1 contains we can delete x from C 1 because 
C 1 \x = CoC f . In other words, the fact that C almost subsumes C 1 allows us at 
least to strengthen C 1 , without actually removing it. Again there’s no erp rule. 
[Self-subsumption was called “the replacement principle” by J. A. Robinson in 

JACM 12 (1965), 39.] 

Exercise 374 discusses data structures and algorithms by which subsump¬ 
tions and self-subsumptions can be discovered with reasonable efficiency. 

Transformation 4. Downhill resolution. Suppose x appears only in clauses 
Ci, … ， C p and x appears only in C[^ C’ q . We’ve observed (see ( 112 )) that 
variable x can be eliminated if we replace those p + g clauses by the pq clauses 
{C i oCj I 1 < i < p, 1 < j < g}. The corresponding erp rule (see exercise 367) is 

P Q 

or X -(r- (Cj \ x) . ( 161 ) 

i=i 

Every variable can be eliminated in this way，but we might be flooded with 
too many clauses. We can prevent this by limiting ourselves to “downhill” cases ， 
in which the new clauses don’t outnumber the old ones. The condition pq < p+q 
is equivalent to (p— l)(q — 1 ) < 1 , as noted above following ( 112 ); the variable is 
always removed in such cases. But the number of new clauses might be small even 
when pq is large, because of tautologies or subsumption. Furthermore, N. Een 
and A. Biere wrote a fundamental paper on preprocessing [LNCS 3569 (2005 )， 
61-75] that introduced important special cases in which many of the pq potential 
clauses can be omitted; see exercise 369. Therefore a preprocessor typically tries 
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to eliminate via resolution whenever min(p 5 q) < 10 ， say，and abandons the 
attempt only when more than p + q resolvents have been generated. 

Many other transformations are possible, although the four listed above have 
proved to be the most effective in practice. We could，for instance, look for failed 
literals: If unit propagation leads to a contradiction when we assume that some 
literal l is true (namely when F A (l) hi e), then we’re allowed to assume that 
l is false (because the unit clause (l) is certifiable). This observation and several 
others related to it were exploited in the lookahead mechanisms of Algorithm Y 
above. But Algorithm C generally has no trouble finding failed literals all by 
itself, as a natural byproduct of its mechanism for resolving conflicts. Exercises 
378-384 discuss other techniques that have been proposed for preprocessing. 

Sometimes preprocessing turns out to be dramatically successful. For exam¬ 
ple, the anti-maximal-element clauses of exercise 228 can be proved unsatisfiable 
via transformations 1—4 after only about 400 megamems of work when m = 50. 
Yet Algorithm C spends 3 gigamems on that untransformed problem when m is 
only 14; and it needs 11 Gfi when m = 15,…， failing utterly before m = 20. 

A more typical example arises in connection with Fig. 35 above: The problem 
of showing that there’s no 4-step path to LIFE involves 8725 variables，33769 
clauses, and 84041 literals，and Algorithm C requires about 6 gigamems to 
demonstrate that those clauses are unsatisfiable. Preprocessing needs less than 
10 megamems to reduce that problem to just 3263 variables，19778 clauses, and 
56552 literals; then Algorithm C can handle those with 5 Gfi of further work. 

On the other hand, preprocessing might take too long, or it might produce 
clauses that are more difficult to deal with than the originals. It’s totally useless 
on the waerden or langford problems. (Further examples are discussed below.) 

Encoding constraints into clauses. Some problems，like waerden k] n)，are 
inherently Boolean, and they’re essentially given to us as native-born ANDs of 
ORs. But in most cases we can represent a combinatorial problem via clauses in 
many different ways，not immediately obvious，and the particular encoding that 
we choose can have an enormous effect on the speed with which a SAT solver is 
able to crank out an answer. Thus the art of problem encoding turns out to be 
just as important as the art of devising algorithms for satisfiability. 

Our study of SAT instances has already introduced us to dozens of interesting 
encodings; and new applications often lead to further ideas, because Boolean 
algebra is so versatile. Each problem may seem at first to need its own special 
tricks. But we’ll see that several general principles are available for guidance. 

In the first place, different solvers tend to like different encodings: An 
encoding that’s good for one algorithm might be bad for another. 

Consider，for example, the at-most-one constraint, yi + • • • + < 1, which 

arises in a great many applications. The obvious way to enforce this condition 
is to assert (幻 binary clauses (扒 V 如）， for 1 < i < j < p ， so that yi = yj = 1 is 
forbidden; but those clauses become unwieldy when p is large. The alternative 
encoding in exercise 12， due to Marijn Heule, does the same job with only 
3p — 6 binary constraints when p > 3, by introducing a few auxiliary variables 
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ai, … ， a[( p _ 3 )/ 2 j - When we formulated Langford’s problem in terms of clauses ， 
via ( 12 ) ，（ 13 )，and ( 14 ) above，we therefore considered two variants called 
langford (n) and langford/ (n), where the former uses the obvious encoding of at- 
most-one constraints and the latter uses Heule’s method. Furthermore, exercise 
7.1.1-55(b) encoded at-most-one constraints in yet another way, having the same 
number of binary clauses but about twice as many auxiliary variables; let’s give 
the name langford 11 (n) to the clauses that we get from that scheme. 

We weren’t ready to discuss which of the encodings works better in practice, 
when we introduced langford (n) and langford/ (n) above，because we hadn’t yet 
examined any SAT-solving algorithms. But now we’re ready to reveal the answer; 
and the answer is: “It depends.” Sometimes langford 1 {n) wins over langford(n)' 
sometimes it loses. It always seems to beat langford"(n). Here，for example，are 
typical statistics, with runtimes rounded to megamems (M/x) or kilomems (K/x): 

variables clauses Algorithm D Algorithm L Algorithm C 


langford (9) 

104 

1722 

23M/x 

i6M/x 

15M/x 

(UNSAT) 

langford 1 (^) 

213 

801 

82 M// 

16 M// 

21M/x 

(UNSAT) 

langford" (9) 

335 

801 

139 Mfi 

20 M/x 

24 M// 

(UNSAT) 

langford (13) 

228 

5875 

71685 M// 

45744 M/x 

295571 M/x 

(UNSAT) 

langford 1 (13) 

502 

1857 

492992 M// 

38589 M/x 

677815 M/x 

(UNSAT) 

langford" (13) 

795 

1857 

950719 M/x 

46398 M/x 

792757 M/x 

(UNSAT) 

langford (16) 

352 

11494 

5M/i 

52 M/x 

301K// 

(SAT) 

langford 1 (16) 

796 

2928 

12M/x 

31M/x 

418 K// 

(SAT) 

langford," (16) 

1264 

2928 

20M/x 

38 Mfi 

510 K/x 

(SAT) 

langford (64) 

6016 

869650 

(huge) 

(bigger) 

35M/x 

(SAT) 

langford 1 ^) 14704 

53184 

(huger) 

(big) 

73M// 

(SAT) 

langford"(6 23488 

53184 

(hugest) 

(biggest) 

304 M/x 

(SAT) 


Algorithm D prefers langford to langford 1 ， because it doesn’t perform unit prop¬ 
agations very efficiently. Algorithm L，which excels at unit propagation, likes 
langford 1 better. Algorithm C also excels at unit propagation, but it exhibits 
peculiar behavior: It prefers langford ， and on satisfiable instances it zooms in 
quickly to find a solution; but for some reason it runs very slowly on unsatisfiable 
instances when n > 10. 

Another general principle is that short encodings — encodings with few vari¬ 
ables and/or few clauses — are not necessarily better than longer encodings. 
For example，we often need to use Boolean variables to encode the value of a 
variable x that actually ranges over d > 2 different values, say 0 < x < d. In 
such cases it’s natural to use the binary representation x = (xi-i …: ^ 0 ) 2 , where 
l = 「 lgd]，and to construct clauses based on the independent bits Xj] but that 
representation, known as the log encoding, surprisingly turns out to be a bad idea 
in many cases unless d is large. A direct encoding with d binary variables 
… ， Xd-i, where Xj = [x = j], is often much better. And the order encoding 
with d — 1 binary variables x 1 ^ • • • ， x d-1 ， where = [x>j]^ is often better 
yet; this encoding was introduced in 1994 by J. ML Crawford and A. B. Baker 
[AAAI Conf. 12 (1994) ， 1092-1097]. In fact，exercise 408 presents an important 


Langford’s problem 

langford" (n) 

unit propagations 
binary representation 
log encoding 
direct encoding 

sparse encoding, see direct encoding 

order encoding 

Crawford 

Baker 


September 23, 2015 



7 . 2 . 2.2 SATISFIABILITY: ENCODING CONSTRAINTS INTO CLAUSES 99 

application where the order encoding is the method of choice even when d is 1000 
or more! The order encoding is exponentially larger than the log encoding，yet it 
wins in this application because it allows the SAT solver to deduce consequences 
rapidly via unit propagation. 

Graph coloring problems illustrate this principle nicely. When we tried early 
in this section to color a graph with d colors，we encoded the color of each vertex 
with a direct representation ， ( 15 ); but we could have used binary notation for 
those colors. And we could also have used the order encoding，even though 
the numerical ordering of colors is irrelevant in the problem itself. With a log 
encoding，exercise 391 exhibits three distinct ways to enforce the constraint that 
adjacent vertices have different colors. With the order encoding, exercise 395 
explains that it’s easy to handle graph coloring. And there also are four ways 
to work with the direct encoding，namely (a) to insist on one color per vertex 
by including the at-most - one exclusion clauses ( 17 ); or (b) to allow multivalued 
(multicolored) vertices by omitting those clauses; or (c) to actually welcome 
multicolored vertices, by omitting ( 17 ) and forcing each color class to be a kernel ， 
as suggested in answer 14; or (d) to include ( 17 ) but to replace the “preclusion” 
clauses ( 16 ) by so-called “support” clauses as explained in exercise 399. 

These eight options can be compared empirically by trying to arrange 64 
colored queens on a chessboard so that no queens of the same color appear in 
the same row, column, or diagonal. That task is possible with 9 colors, but not 
with 8 . By symmetry we can prespecify the colors of all queens in the top row. 


encoding 

colors 

variables 

clauses 

Algorithm L Algorithm C 


uni valued 

8 

512 

7688 

3333 M/x 

9813 M/x 

(UNSAT) 

multivalued 

8 

512 

5896 

1330 Mfi 

11997 M/x 

(UNSAT) 

kernel 

8 

512 

6408 

4196 Mfi 

12601M// 

(UNSAT) 

support 

8 

512 

13512 

16796 M 11 

20990 M/x 

(UNSAT) 

log ⑷ 

8 

2376 

5120 

(immense) 

20577 M// 

(UNSAT) 

log(b) 

8 

192 

5848 

(enormous) 

15033 Mfi 

(UNSAT) 

log(c) 

8 

192 

5848 

(enormous) 

15033 M/x 

(UNSAT) 

order 

8 

448 

6215 

43615 M/x 

5122 M// 

(UNSAT) 

uni valued 

9 

576 

8928 

2907 M/x 

464 M// 

(SAT) 

multivalued 

9 

576 

6624 

104 M/x 

401 M/x 

(SAT) 

kernel 

9 

576 

7200 

93M/x 

87 M// 

(SAT) 

support 

9 

576 

15480 

2103 M/x 

613 M// 

(SAT) 

log ⑷ 

9 

3168 

6776 

(gigantic) 

1761 M/x 

(SAT) 

log(b) 

9 

256 

6776 

(colossal) 

1107M// 

(SAT) 

log(c) 

9 

256 

6584 

(mammoth) 

555 Mfi 

(SAT) 

order 

9 

512 

7008 

(monstrous) 

213 M/x 

(SAT) 


(Each running time shown here is the median of nine runs, made with different 
random seeds.) It’s clear from this data that the log encodings are completely 
unsuitable for Algorithm L; and even the order encoding confuses that algo¬ 
rithm^ heuristics. But Algorithm L shines over Algorithm C with respect to 
most of the direct encodings. On the other hand, Algorithm C loves the order 
encoding, especially in the difficult unsatisfiable case. 
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And that’s not the end of the story. H. Tajima [M.S. thesis，Kobe University 
(2008)] and N. Tamura noticed that order encoding has another property，which 
trumps all other encodings with respect to graph coloring: Every ^-clique of 
vertices {v\ ，…， 仰 } in a graph allows us to append two additional “hint clauses” 

{vf~ kJrl V ... V vf~ kJrl ) A {v^~ l V ... V vl~ l ) (162) 

to the clauses for d-coloring — because some vertex of the clique must have a 
color < d—k^ and some vertex must have a color > A: — 1. With these additional 
clauses, the running time to prove unsatisfiability of the 8 -coloring problem drops 
drastically to just 60 Mfi with Algorithm L，and to only 13 Mfi with Algorithm C. 
We can even reduce it to just 2 M/x(!) by using that idea twice (see exercise 396). 

The order encoding has several other nice properties, so it deserves a closer 
look. When we represent a value x in the range 0 < x < dby the binary variables 
= [x >j] for 1 < j < we always have 

X = x 1 x 2 X d ~ l ] (163) 

hence order encoding is often known as unary representation. The axiom clauses 

(^ J+1 V x^) for 1 < j < d — 1 (164) 

are always included，representing the fact that x > j + l implies x > j for each j ; 

these clauses force all the Is to the left and all the 0s to the right. When d = 2 

the unary representation reduces to a one-bit encoding equal to x itself; when 

d = 3 it’s a two-bit encoding with 00, 10, and 11 representing 0, 1， and 2. 

# 

We might not know all of the bits x J of x^s unary encoding while a problem 
is in the course of being solved. But if we do know that, say, x 3 = 1 and x 7 = 0, 
then we know that x belongs to the interval [3.. 7). 

Suppose we know the unary representation of x. Then no calculation is 
necessary if we want to know the unary representation of y = $ + a，when a is a 
constant, because = x^~ a . Similarly, z = a — x is equivalent to = x a+1_J ; 
and w = L$/a」is equivalent to = x aj • Out-of-bounds superscripts are easy to 
handle in formulas such as this, because x 1 — 1 when i <0 and x 1 — 0 when i > d. 
The special case x = d — 1 — x is obtained by left-right reflection of x 1 ... x^~ l \ 

(d — 1 — xy = (xy = x d ~^ . ( 165 ) 

If we are using the order encoding for two independent variables x and y ， 
with 0 < x^y < it’s similarly easy to encode the additional relation x <y-\- a: 

min(d—l ， d+a) 

x — y < a x <y a 八 [x^ V (166) 

j—max(0,a+l) 

And there are analogous ways to place bounds on the sum, x + y: 

min(d—l ， a+l) 

x y < a x<y-\-a-\-l~d 八 [x^ V y a+1_J ); (167) 

jf—max(0,a+2 —d) 


Tajima 

Tamura 

clique 

hint clauses 
unary representation 
axiom clauses 
ternary numbers 

complementation of unary representation 


September 23, 2015 



7.2.2.2 SATISFIABILITY: ENCODING CONSTRAINTS INTO CLAUSES 101 

min(d ， a) 

x y > a x<y — a — l-\-d 4=^ 八 {x^ V y a+1_J ). ( 168 ) 

jf—max(l,a + l—d) 

In fact, exercise 405 shows that the general condition ax + by < c can be enforced 
with at most d binary clauses, when a ， 6 , and c are constant. Any set of such rela¬ 
tions, involving at most two variables per constraint，is therefore a 2SAT problem. 

Relations between three or more order-encoded variables can also be handled 
without difficulty, as long as d isn’t too large. For example, conditions such as 
x + y < z and x + y > z can be expressed with 0(d log d) clauses of length 
< 3 (see exercise 407). Arbitrary linear inequalities can also be represented ， 
in principle. But of course we shouldn’t expect SAT solvers to compete with 
algebraic methods on problems that are inherently numerical. 

Another constraint of great importance in the encoding of combinatorial 
problems is the relation of lexicographic order: Given two bit vectors x\ ... x n 
and yi .. .y n , we want to encode the condition (x\ .. .x n )2 < (yi • • .y n )2 as a 
conjunction of clauses. Fortunately there’s a nice way to do this with just 3n — 2 
ternary clauses involving n — 1 auxiliary variables ai, … ， a n _i，namely 

n —1 

八 ((x k Vy k Va k -i) A (x k Va k Vd k -i) A (y k Va k Vd k -i)) A (^ n Vy n Va n _i), ( 169 ) 

where c ao is omitted. For example, the clauses 


(xiVyi)A(^iVai)A(yiVai)A(^ 2 V^2Vai) A(x 2 Va2Vai) A(y2Va2Vai)A(x3Vy 3 Va2) 


assert that ^ 1 X 2^3 < V\V 2 Vz* And the same formula, but with the final term 
(x n V y n V a n -i) replaced by (x n V a n _i) A (y n V a n _i), works for the strict 
comparison x\ .. .x n < ?/i • • • These formulas arise by considering the carries 
that occur when (^1 ... x n ) 2 + (l or 0) is added to (yi • • • y n ) 2 . (See exercise 414.) 

The general problem of encoding a constraint on the Boolean variables 
xi^ ^ x n is the question of finding a family of clauses F that are satisfiable 
if and only if f(xi^x n ) is true, where / is a given Boolean function. We 
usually introduce auxiliary variables ai, … ， a m into the clauses of F, unless / 
can be expressed directly with a short CNF formula; thus the encoding problem 
is to find a “good” family F such that we have 

= 1 彐 CM... 彐 f\ C ， ( 170 ) 

ceF 

where each (7 is a clause on the variables {ai ， …， a m ^x\ ， … ， x n }. The variables 
ai ，• • 。 a m can be eliminated by resolution as in ( 112 )，at least in principle ， leav¬ 
ing us with a CNF for / — although that CNF might be huge. (See exercise 248.) 

If there’s a simple circuit that computes /， we know from ( 24 ) and exercise 42 
that there’s an equally simple “Tseytin encoding” F，with one auxiliary variable 
for each gate in the circuit. For example，suppose we want to encode the 
condition x\ .. .x n / yi ...• The shortest CNF expression for this function 
f(xi, … ， y n ) has 2 n clauses (see exercise 415); but therea simple 
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circuit (Boolean chain) with just n + 1 gates: 

勿 卜 ： © 扒 ，••” a n 4 - x n Qy n , / 卜勿 V • • • V a n . 

Using ( 24 ) we get the An clauses 

n 

八 （(％ v % v %•) A (巧 V % V %) A (〜• V % V %) A (%• V % V %)), ( 171 ) 

i-i 

together with (ai V … V a n ), as a representation of c x\ ... x n / yi ... yn • 

But this is overkill; D. A. Plaisted and S. Greenbaum have pointed out 
[Journal of Symbolic Computation 2 (1986) ， 293-304] that we can often avoid 
about half of the clauses in such situations. Indeed，only 2n of the clauses ( 171 ) 
are necessary (and sufficient)，namely the ones involving ay. 

n 

八((巧 V 的 V %) 八 （％ V % V %)). ( 172 ) 

j 二 1 

The other clauses are “blocked” (see exercise 378) and unhelpful. Thus it’s a 
good idea to examine whether all of the clauses in a Tseytin encoding are really 
needed. Exercise 416 illustrates another interesting case. 

An efficient encoding is possible also when / has a small BDD，and in general 
whenever / can be computed by a short branching program. Recall the example 
“Pi function” introduced in 7.1.1—( 22 ); we observed in 7.1.2-(6) that it can be 
written (((x 2 A ^ 4 ) ㊉ 龙 3 ) 八 无 i) ㊉ X 2 - Thus it has a 12-clause Tseytin encoding 

(^Vai) A (x^di) A (^V^Vai) A (^3VaiVa2) A (^3VaiVa2) A (^3VaiVa2) 

A (x 3 VaiVa 2 ) A (xiWds) A (( 22 V 如 ） A (xiW^Was) A (x 2 Va 3 ) A (^Vd 3 ). 

The Pi function also has a short branching program ， 7.1.4—( 8 )，namely 

h = (l?7:6 ),/ 7 - (2?5:4 ),/ 6 - (2?0:l ),/ 5 - (3? 1:0), 

h = (3?3:2 ),/ 3 = (4?l:0 ),/ 2 = (4? 0:1), 

where the instruction c (v? l:h)’ means u If x v = 0 5 go to //, otherwise go to Ih,” 
except that Iq and h unconditionally produce the values 0 and 1. We can 
convert any such branching program into a sequence of clauses, by translating 
c Ij = (v? 1 : hy into 

(aj V ^ V ai) A (dj W x v W a h ), ( 173 ) 

where ao is omitted, and where any clauses containing a\ are dropped. We 
also omit 屯 ， where I t is the first instruction; in this example t = 8 . (These 
simplifications correspond to asserting the unit clauses (ao) A (ai) A (a t ).) The 
branching program above therefore yields ten clauses ， 

(^iV a^) A (^iV ae) A {a^M ^2 V a^) A (« 7 V ^2 V a^) A (a^W X 2 ) 

A («5 V xs) A («4 V X 3 V as) /\ (a 4 V 无 3 V a 2 ) A (a^W x 4 ) A (a 2 V X 4 ). 

We can readily eliminate a 6 , a 5 , < 23 , < 22 , thereby getting a six-clause equivalent 

(xiV a^) A ( 无 iV^ 2 ) A (« 7 Va ： 2 V^ 3 ) A (« 7 V^ 2 V a^) A ( 知 V $3 V 无 4 ) A (« 4 VX 3 V^ 4 ); 
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and a preprocessor will simplify this to the four-clause CNF 

(xiW X2) A (X2VX3) A (xiW X2\/ xsW X4) A (xiW X4), (174) 

which appeared in exercise 7.1.1-19. 

Exercise 417 explains why this translation scheme is valid. The method 
applies to any branching program whatsoever: The x variables can be tested in 
any order — that is，the r’s need not be decreasing as in a BDD; moreover, a 
variable may be tested more than once. 

Unit propagation and forcing. The effectiveness of an encoding depends 
largely on how well that encoding avoids bad partial assignments to the vari¬ 
ables. If we’re trying to encode a Boolean condition /($i ， $ 2 , • • • ，冗 n)，and if the 
tentative assignments xi ^ 1 and X 2 0 cause / to be false regardless of the 
values of xs through x n , we’d like the solver to deduce this fact without further 
ado, ideally by unit propagation once x\ and X 2 have been asserted. With a 
CDCL solver like Algorithm C，a quickly recognized conflict means a relatively 
short learned clause — and that’s a hallmark of progress. Even better would be 
a situation in which unit propagation, after asserting would already force X 2 
to be true; and furthermore if unit propagation after x 2 would also force x\. 

Such scenarios aren’t equivalent to each other. For example，consider the 
clauses F = (xiW xs) A (xiW xj x^). Then，using the notation hi T to signify 
that F leads to l via unit propagation, we have F \ x\ hi X 2 ^ but F \x 2 l/i 龙 i. 
And with the clauses G = (xi V X 2 V ^ 3 ) A {x\ V 奶 V 无 3 ) we have G\x\\x 2 l~i e 
(see Eq. ( 119 ))，but G\x\ t/i and G |^2 ^1 • 

Consider now the simple at-most-one constraint on just three variables ， 
f(x\^X 2 ^xs) = [x\ + X 2 + xs <1]. We can try to represent / by proceeding 
methodically using the methods suggested above, either by constructing a circuit 
for / or by constructing /’s BDD. The first alternative (see exercise 420) yields 

F = (xiV^ 2 Vai) A (^iVa：2Vai) A (xiW X2V a\) A (xi\/ X2) A (无 3V ai); (175) 

the second approach (see exercise 421) leads to a somewhat different solution ， 

G = ($iV (24) 八(无 i V a3) 八 (a^ 无 2Va2) 八 (dsWX2N a^) l\ (a^\Jx^) f\ (&2V^3). (176) 

But neither of these encodings is actually very good，because F\x^ l/i x\ and 
G\x^ \/i x\. Much better is the encoding that we get from the general scheme 
of ( 18 ) and ( 19 ) in the case n = 3, r = 1， namely 

S = (aiV 02) A (xiM ai) A (办 V a2) 八 (X2V a\) A (无 3V & 2)， (177) 

where a\ and stand for s\ and or the one obtained from ( 20 ) and ( 21 )， 

B = (xs\/ a\) A (^ 2 V ai) A (^ 2 V ^3) A (aiV 无 1)， (178) 

where a\ stands for With either ( 177 ) or ( 178 ) we have S \ Xi hi Xj and 
B \xi hi Xj by unit propagation whenever i ♦And of course the obvious 
encoding for this particular / is best of all, because n is so small: 

o = {xiW x 2 ) A (^1V ^3) A (x 2 \/ Xs). (179) 
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Suppose ^x n ) is a Boolean function that’s represented by a family 

of clauses F, possibly involving auxiliary variables {ai ， … ， a m }，as in ( 170 ). We 
say that F is a forcing representation if we have 

F\L h l implies F\L hi l ( 180 ) 

whenever L U Ms a set of strictly distinct literals contained in {$i ， …• ， : r n ， 
龙 1 ， … ，龙 n}. In other words，if the partial assignment represented by L logically 
implies the truth of some other literal we insist that unit propagation alone 
should be able to deduce l from F | L. The auxiliary variables {ai ， … ， a m } 
are exempt from this requirement; only the potential forcings between primary 
variables {x\^ •••，〜} are supposed to be recognized easily when they occur. 

( Technical point: If F | L h e, meaning that F \ L is imsatisfiable，we 
implicitly have F | L h / for all literals l. In such a case ( 180 ) tells us that 
F I L hi l and F | L hi l both hold; hence F \ L can then be proved unsatisfiable 
by unit propagation alone.) 

We’ve seen that the clauses S and B in ( 177 ) and ( 178 ) are forcing for the 
constraint [x\ + + ^3 < 1], but the clauses F and G in ( 175 ) and ( 176 ) are not. 

In fact，the clauses of ( 18 ) and ( 19 ) that led to ( 177 ) are always forcing，for the 

general cardinality constraint [x\ H - + x n < r]; and so are the clauses of ( 20 ) 

and ( 21 ) that led to ( 178 ). (See exercises 429 and 430.) Moreover，the general 
at-most-one constraint [xi + • • • + x n < 1 ] can be represented more efficiently by 
Heule’s 3(n —2) binary clauses and L( n —3)/2」auxiliary variables (exercise 12)，or 
with about nlgn binary clauses and only 卩 gn] auxiliary variables (exercise 394); 
both of these representations are forcing. 

In general ， we’re glad to know as soon as possible when a variable’s value has 
been forced by other values, because the variables of a large problem typically 
participate in many constraints simultaneously. If we know that x can’t be 0 in 
constraint /， then we can often conclude that some other variable y can’t be 1 in 
some other constraint g/if x appears in both / and g. There’s lots of feedback. 

On the other hand it might be worse to use a large representation F that 
is forcing than to use a small representation G that isn’t, because additional 
clauses can make a SAT solver work harder. The tradeoffs are delicate，and 
they’re difficult to predict in advance. 

Every Boolean constraint f(x\ ， ••” x n ) has at least one forcing represen¬ 
tation that involves no auxiliary variables. Indeed, it’s easy to see that the 
conjunctive prime form F of f — the AND of all /’s prime clauses — is forcing. 

Smaller representations are also often forcing，even without auxiliaries. For 
example, the simple constraint [x\ >X 2 > — > x n ] has ⑵ prime clauses，namely 
(xj V Xk) for 1 < j < k < n; but only n — 1 of those clauses, the cases when 
A: = j + 1 as in ( 164 )，are necessary and sufficient for forcing. Exercise 424 
presents another, more-or - less random example. 

In the worst case, all forcing representations of certain constraints are known 
to be huge, even when auxiliary variables are introduced (see exercise 428). 
But exercises 431—441 discuss many examples of useful and instructive forcing 
representations that require relatively few clauses. 
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WeVe glossed over an interesting technicality in definition ( 180 ) ， however: 
A sneaky person might actually construct a representation F that is absolutely 
useless in practice, even though it meets all of those criteria for forcing. For exam¬ 
ple, let G(a\^ … ， a m ) be a family of clauses that are satisfiable — but only when 
the auxiliary variables aj are set to extremely hard - to-find values. Then we might 
have f(xi) = x\ and F = (xi) AG(ai^... ， a m )(!). This defect in definition ( 180 ) 
was first pointed out by M. GWynne and O. Kullmann [arXiv ： 1406.7398 [cs-CC] 
(2014)，67 pages]，who have also traced the history of the subject. 

To avoid such a glitch, we implicitly assume that F is an honest represen¬ 
tation of /， in the following sense: Whenever L is a set of n literals that fully 
characterizes a solution xi ... x n to the constraint f(xi ， … ， x n ) = 1 ， the clauses 
F I L must be easy to satisfy, using the SLUR algorithm of exercise 444. That 
algorithm is efficient because it does not backtrack. All of the examples in exer¬ 
cises 439-444 meet this test of honesty; indeed, the test is automatically passed 
whenever every clause of F contains at most one negated auxiliary variable. 

Some authors have suggested that a SAT solver should branch only on 
primary variables rather than on auxiliary variables aj, whenever possible. 
But an extensive study by M. Jarvisalo and I. Niemela [LNCS 4741 (2007), 
348-363; J. Algorithms 63 (2008), 90-113] has shown that such a restriction is 
not advisable with Algorithm C, and it might lead to a severe slowdown. 

Symmetry breaking. Sometimes we can achieve enormous speedup by exploit¬ 
ing symmetries. Consider, for example，the clauses for placing m + 1 pigeons into 
m holes, ( 106 )-( 107 ). We’ve seen in Lemma B and Theorem B that Algorithm C 
and other resolution-related methods cannot demonstrate the unsatisfiability of 
those clauses without performing exponentially many steps as m grows. However ， 
the clauses are symmetrical with respect to pigeons; independently, they’re also 
symmetrical with respect to holes: If 丌 is any permutation of {0, 1，••” m} and 
if p is any permutation of { 1 ， 2 , • • • ， m}，the transformation Xjk ^ x (jn)(kp) f° r 
0 < j < m and 1 < k < m leaves the set of clauses ( 106 )-( 107 ) unchanged. 
Thus the pigeonhole problem has (m + l)!m! symmetries. 

We’ll prove below that the symmetries on the holes allow us to assume safely 
that the hole-occupancy vectors are lexicographically ordered, namely that 

^Ok^lk - - - ^mk ^ $0 (众 +1)$1 (众 +1) • • • (众 +1) ， 【Or \ ^ k <C. T71. (l 8 l) 

These constraints preserve satisfiability; and we know from ( 169 ) that they are 
readily expressed as clauses. Without the help of such additional clauses the 
running time of Algorithm C rises from 19 megamems for m = 7 to 177 M/x for 
m = 8 , and then to 3.5 gigamems and 86 Gfi for m = 9 and 10. But with ( 181 )， 
the same algorithm shows unsatisfiability for m = 10 after only 1 megamem; and 
for m = 20 and m = 30 after only 284 Mfi and 3.6 G// ， respectively. 

Even better results occur when we order the p^eon-occupancy vectors: 

^jl^j 2 • • • ^jm ^ $(j + l) 1 $(jf+l )2 • • 【Or 0 ^ J < 777 /. ( 182 ) 

With these constraints added to ( 106 ) and ( 107 )，Algorithm C polishes off the 
case m = 10 in just 69 kilomems. It can even handle m = 100 in 133 Mfi. This 
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remarkable improvement was achieved by adding only m 2 —m new variables and 
3m 2 —2m new clauses to the original m 2 +m variables and (m +1) + (m 3 + m 2 ) /2 
clauses of ( 106 ) and ( 107 ). (Moreover, the reasoning that justifies ( 182 ) doesn’t 
“cheat” by invoking the mathematical pigeonhole principle behind the scenes.) 

Actually that’s not all. The theory of columnwise symmetry (see exer¬ 
cise 498) also tells us that we’re allowed to add the ⑺ simple binary clauses 

(^_ 1 ^ V for 1 <j <k <m ( 183 ) 

to ( 106 ) and ( 107 )，instead of ( 182 ). This principle is rather weak in general; but 
it turns out to be ideally suited to pigeons: It reduces the running time for m = 
100 to just 21 megamems，although it needs no auxiliary variables whatsoever! 

Of course the status of ( 106 )-( 107 ) has never been in doubt. Those clauses 
serve merely as training wheels because of their simplicity; they illustrate the 
fact that many symmetry - breaking strategies exist. Let’s turn now to a more 
interesting problem, which has essentially the same symmetries，but with the 
roles of pigeons and holes played by “points” and “lines” instead. Consider a set 
of m points and n lines, where each line is a subset of points; we will require that 
no two points appear together in more than one line. (Equivalently, no two lines 
may intersect in more than one point.) Such a configuration may be called quad- 
free^ because it is equivalent to an m x n binary matrix (xij) that contains no 
“quad，” namely no 2 x 2 submatrix of Is; element means that point i belongs 
to line j. Quad-free matrices are obviously characterized by (?) (^) clauses ， 

(x^ V Xiy V x^j V for 1 < i < i f < m and 1 < j < j f < n. ( 184 ) 

What is the maximum number of Is in an m x n quad-free matrix? [This 
question，when m = n，was posed by K. Zarankiewicz ， Colloquium Mathematics 
2 (1951) ， 301， who also considered how to avoid more general submatrices of Is. 
Let’s call that value Z(m^n) — 1; then Z(m^n) is the smallest r such that every 
m x n matrix with r nonzero entries contains a quad. 

We’ve actually encountered examples of this problem before, but in a dis¬ 
guised form. For example (see exercise 448)，a Steiner triple system on v objects 
exists if and only if v is odd and there is a quad-free matrix with m = v, 
n = v(v — 1)/6，and r = v(v — 1)/2. Other combinatorial block designs have 
similar characterizations. 

Table 5 shows the values of Z(m ， n) for small cases. These values were dis¬ 
covered by delicate combinatorial reasoning，without computer assistance; so it’s 
instructive to see how well a SAT solver can compete against real intelligence. 

The first interesting case occurs when m = n = 8 : One can place 24 
markers on a chessboard without forming a quad，but Z(8^ 8 ) = 25 markers is too 
many. If we simply add the cardinality constraints YllLi x ij > ^ to ( 184 )， 

Algorithm C will quickly find a solution when m = n = 8 and r = 24. But it 
bogs down when r = 25, requiring about 10 teramems to show unsatisfiability. 

Fortunately we can take advantage of ml n\ symmetries，which permute rows 
and columns without affecting quads. Exercise 495 shows that those symmetries 
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Z(m ， n)，THE MINIMUM NUMBER OF IS WITH (184) UNSATISFIABLE 
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[References: R- K. Guy，in Theory of Graphs, Tihany 1966, edited by Erdos and Katona (Aca¬ 
demic Press, 1968) ， 119-150; R. J. Nowakowski ， Ph.D. thesis (Univ. of Calgary, 1978) ， 202.] 
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allow us to add the lexicographic constraints 

^il ^i2 • • • ^in ^ $(i+l)l$(i+l)2 • • • $(i+l)n ， for 1 ^ i < 771 ， (185) 

X\jX2j - - . X m j ^ ^l(jf+l)^2(jf + l) * # • ^m(jf+l) •)【Or 1 ^ J <77/. (l86) 

(Increasing order, with in place of could also have been used, but 
decreasing order turns out to be better; see exercise 497.) The running time to 
prove unsatisfiability when r = 25 now decreases dramatically, to only about 50 
megamems. And it falls to 48 Mfi if the lexicographic constraints are shortened 
to consider only the leading 4 elements of a row or column, instead of testing all 8 . 

The constraints of ( 185 ) and ( 186 ) are useful in satisfiable problems too — 
not in the easy case m = n = 8 , when they aren’t necessary, but for example in 
the case m = n = 13 when r = 52: Then they lead Algorithm C to a solution 
after about 200 gigamems，while it needs more than 18 teramems to find a 
solution without such help. (See exercise 449.) 

Satisfiability-preserving maps. Let’s proceed now to the promised theory of 
symmetry breaking. In fact, we will do more: Symmetry is about permutations 
that preserve structural properties，but we will consider arbitrary mappings 
instead. Mappings are more general than permutations, because they needn’t be 
invertible. If x — x\ ... x n is any potential solution to a satisfiability problem ， 
our theory is based on transformations r that map x xr = x[ ... ^ where 

xt is required to be a solution whenever x is a solution. 

In other words，if F is a family of clauses on n variables and if f(x )= 
[x satisfies _F]，then we are interested in all mappings r for which f(x) < f (xt) • 
Such a mapping is conventionally called an endomorphism of the solutions.* If an 

* This word is a bit of a mouthful- But it’s easier to say “endomorphism” than to say 
“satisfiability-preserving transformation,” and you can use it to impress your friends- The 
term “conditional symmetry” has also been used by several authors in special cases. 
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endomorphism r is actually a permutation, it’s called an automorphism. Thus, 
if there are K solutions to the problem，out oi N = 2 n possibilities，the total 
number of mappings is N n ' the total number of endomorphisms is K k N n ~ k ] 
and the total number of automorphisms is K\ (N — K)\. 

Notice that we don’t require f(x) to be exactly equal to f (xt) • An endomor¬ 
phism is allowed to map a nonsolution into a solution，and only K k (N — K) n ~ k 
mappings satisfy that stronger property. On the other hand, automorphisms 
always do satisfy f(x) = /(xr); see exercise 454 . 

Here，for instance，is a more - or-less random mapping when n = 4 : 


1100 


0001—0010 — 0100 — 1111 —1101 - >0000 


0011 ^ 


1011 -> 1010->0101 4 0110 


0111 


1001 




(187) 


Exercises 455 and 456 discuss potential endomorphisms of this mapping. 

In general there will be one or more cycles, and every element of a cycle is 
the root of an oriented tree that leads to it. For example, the cycles of (187) are 
(0011), (1010 0101 0110), and (1000). 

Several different endomorphisms Ti, T2， •…， are often known. In such 
cases it’s helpful to imagine the digraph with 2 n vertices that has arcs from each 
vertex x to its successors XT2, •… ， xr p . This digraph will have one or more 
sink components, which are strongly connected components Y from which there 
is no escape: If x then xr^ G F for 1 < ^ < p. (In the special case where each 
r/c is an automorphism, the sink components are traditionally called orbits of the 
automorphism group.) When p = 1 ， a sink component is the same as a cycle. 

The clauses F are satisfiable if and only if f(x) = 1 for at least one x. Such 
an x will lead to at least one sink component all of whose elements will satisfy 
f(y) = 1 . Thus it suffices to test satisfiability by checking just one element y in 
every sink component F, to see if f(y) = 1. 

Let’s consider a simple problem based on the “sweep” of an mxn matrix X = 
(Xij), which is the largest diagonal sum of any t x t submatrix: 


sweep (X) 


v ( x nh + x %2j2 + * * • + x hjt)- 

^<ji<j2< •- <jt<n 


(188) 


When X is binary ， sweep(X) is the length of the longest downward-and-rightward 


path that passes through its Is. We can use satisfiability to decide whether such 

a matrix exists having sweep(X) < k and x ij ^ given ooooiiiin 

m, n, k, and r: suitable clauses are exhibited in exercise 460 . A ooooiooon 

solution with m = n = 10 , A: = 3 , and r = 51 appears at the right: It 0001101101 


has 51 ls，but no four of them lie in a monotonic southeasterly path. 


0111111001 

1111100001 


This problem has 2 mn candidate matrices X， and experiments 
with small m and n suggest several endomorphisms that can be 
applied to such candidates without increasing the sweep. 


1010000011 

1010000010 

1110111110 

1111100000 


• ri: If Xij = 1 and : ro+i) = 0 ， and if x^j = 0 for 1 < i 1 < we can set 


Xij <— 0 and ^i(j+i) <— 1. 

• T2 ： If Xij = 1 and = 0 , and if Xiy — 0 for 1 < j 1 < j, we can set 

x^j i ~ 0 and - ^~ 1 - 
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• T 3 ： If the 2x2 submatrix in rows {i^i + 1} and columns {j^j + 1} is we 

can change it to • 

These transformations are justified in exercise 462. They’re sometimes applicable 
for several different i and j ; for instance, T 3 could be used to change any of eight 
different 2x2 submatrices in the example solution. In such cases we make an ar¬ 
bitrary decision，by choosing (say) the lexicographically smallest possible i and j. 

The clauses that encode this problem have auxiliary variables besides 
but we can ignore the auxiliary variables when reasoning about endomorphisms. 

Each of these endomorphisms either leaves X unchanged or replaces it by a 
lexicographically smaller matrix. Therefore the sink components of {ti ， T 2 ， T 3 } 
consist of the matrices X that are fixed points of all three transformations. Hence 
we’re allowed to append additional clauses, stating that neither t\ nor 乃 nor ts 
is applicable. For instance, transformation ts is ruled out by the clauses 

m —1 n — 1 

i^l j 二 1 

which state that the submatrix doesn’t appear. The clauses for n and T 2 are 
only a bit more complicated (see exercise 461). 

These additional clauses give interesting answers in satisfiable instances ， 
although they aren’t really helpful running-time-wise. On the other hand, they’re 
spectacularly successful when the problem is wnsatisfiable. 

For example, we can show，without endomorphisms, that the case m — n — 
10，& = 3，r = 52is impossible, and hence that any solution for r = 51 is 
optimum; Algorithm C proves this after about 16 gigamems of work. Adding 
the clauses for n and T 2 ，but not T 3 ，increases the running time to 23 Gfi ； on 
the other hand the clauses for rs without T\ or r 2 reduce it to 6 Gfi. When we 
use all three endomorphisms simultaneously, however, the running time to prove 
unsatisfiability goes down to just 3.5 me"amems，a speedup of more than 4500. 

Even better is the fact that the fixed points of {ri^T 2 ^rs} actually have an 
extremely simple form — see exercise 463 — from which we can readily determine 
the answer by hand，without running the machine at all! Computer experiments 
have helped us to guess this result; but once weVe proved it, weVe solved 
infinitely many cases in one fell swoop. Theory and practice are synergistic. 

Another interesting example arises when we want to test whether or not 
a given graph has a perfect matching, which is a set of nonoverlapping edges 
that exactly touch each vertex. We’ll discuss beautiful，efficient algorithms for 
this problem in Sections 7.5.1 and 7.5.5; but it’s interesting to see how well a 
simple-minded SAT solver can compete with those methods. 

Perfect matching is readily expressible as a SAT problem whose variables are 
called c uv\ one for each edge u —— v. Variables c uv’ and c vu’ are identical. When¬ 
ever the graph contains a 4-cycle ^ 0 —— v\ —— V 2 —— vg —— 吻 ， we might include 
two of its edges {^ 0^1 ^ 2 ^ 3 } in the matching; but we could equally well have in¬ 
cluded instead. Thus there’s an endomorphism that says，“If ^ 0^1 = 

V 2 V 3 = 1 (hence V 1 V 2 = V 3 V 0 = 0 )，set vqVi <— V 2 V 3 <— 0 and V 1 V 2 <— V 3 V 0 4 - 1 .” 
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And we can carry this idea further: Let the edges be totally ordered in some 
arbitrary fashion, and for each edge uv consider all 4-cycles in which uv is the 
largest edge. In other words，we consider all cycles of the form u —— v —— u’ —— 
v’ —— u in which vu’ ， u’v ， ， v l u all precede uv in the ordering. If any such cycles 
exist, choose one of them arbitrarily，and let r uv be one of two endomorphisms: 

r~ v : u If uv = u'v 1 — 1 , set uv 4 - u l v' 0 and vu’ <- v’u 1 •” 
t 上 ： “If vu’ = v f u = 1 ， set uv 4 - u , v , ^r- 1 and vu f <- v'u 4 - ()•” 

Either r~ v or t 上 is stipulated，for each uv. Exercise 465 proves that a perfect 
matching is in the sink component of any such family of endomorphisms if and 
only if it is fixed by all of them. Therefore we need only search for fixed points. 

For example，consider the problem of covering an mxn board with dominoes. 
This is the problem of finding a perfect matching on the grid graph P m nP n . The 
graph has ran vertices with m(n — 1 ) “horizontal” edges from (i ， j) to 

(i^j + 1) and (m — l)n “vertical” edges from (i^j) to (i + 1 ， 』•)• It has exactly 
(m — l)(n — 1) 4-cycles; and if we number the edges from left to right，no two 
4-cycles have the same largest edge. Therefore we can construct (m — l)(n — 1) 
endomorphisms，in each of which we’re free to decide whether to allow a partic¬ 
ular cycle to be filled by two horizontal dominoes or by two vertical ones. 

Let’s stipulate that hij and are allowed together only when i + j 

is odd; Vij and ^(j+r) are allowed together only when i + j is even. The nine 
endomorphisms when m = n = 4 are then 


(190) 


And it’s not difficult to see that only one 4x4 domino covering is fixed by all 
nine. Indeed (exercise 466)，the solution turns out to be unique for all m and n. 

The famous problem of the “mutilated chessboard” asks for a domino cov¬ 
ering when two opposite corner cells have been removed. This problem is 
unsatisfiable when m and n are both even, by exercise 7.1.4-213. But a SAT 
solver can’t discover this fact quickly from the clauses alone，because there are 
many ways to get quite close to a solution; see the discussion following 7.1.4- 

( 130 ). [S. Dantchev and S. Riis, in FOCS 42 (2001), 220-229, have proved in fact 

that every resolution refutation of these clauses requires 2 fi ( n ) steps.] 

When Algorithm C is presented with mutilated boards of sizes 6 x 6 , 8 x 8 , 
10 x 10, , 16 x 16, it needs respectively about 55K/x, 1.4M//, 31M//, 668 M/x, 

16.5 G//, and .91 Tfi (that’s teramems) to prove unsatisfiability. The even-odd 
endomorphisms typified by ( 190 ) come to our rescue, however: They narrow the 
search space spectacularly, reducing the respective running times to only 15 K //， 

60 K/x, 135 K/x, 250 K/x, 470 K/x, 690 K// (that’s kilomems). They even can verify 
the unsatisfiability of a mutilated 256 x 256 domino cover after fewer than 4.2 Gfi 
of calculation, exhibiting a growth rate of roughly 0 (n 3 ). 
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Endomorphisms can also speed up SAT solving in another important way: 

Theorem E. Let P 1 P 2 ...Pn be any permutation of {1 ， 2,… ， n}. Jf the Bool¬ 
ean function ... ^x n ) is satisfiable，then it has a solution such that 

x Pl x P2 .. • x Pn is lexicographically less than or equal to x' Vl x' V2 ... x' Vri for every 
endomorphism of f that takes x x x 2 • • • x n 4 x[x! 2 … x’ n . 


Proof. The lexicographically smallest solution of / has this property. | 

Maybe we shouldn’t call this a “theorem ”； it’s an obvious consequence of the 
fact that endomorphisms always map solutions into solutions. But it deserves to 
be remembered and placed on some sort of pedestal, because we will see that it 
has many useful applications. 

Theorem E is extremely good news，at least potentially, because every 
Boolean function has a huge number of endomorphisms. (See exercise 457.) 
On the other hand ， there’s a catch: We almost never know any of those endo¬ 
morphisms until after we’ve solved the problem! Still, whenever we do happen 
to know one of the zillions of nontrivial endomorphisms that exist ， we’re allowed 
to add clauses that narrow the search. There’s always a “lex-leader” solution 
that satisfies x 1 x 2 • •• x n < x\x' 2 … if there’s any solution at all. 

A second difficulty that takes some of the shine away from Theorem E is 
the fact that most endomorphisms are too complicated to express neatly as 
clauses. What we really want is an endomorphism that’s nice and simple，so 
that lexicographic ordering is equally simple. 

Fortunately，such endomorphisms are often available; in fact, they’re usually 
automorphisms —— symmetries of the problem —— defined by signed permutations 
of the variables. A signed permutation represents the operation of permuting 
variables and / or complementing them; for example, the signed permutation 
c 4132’ stands for the mapping (xi ， $ 2 , $ 3 ， $ 4 ) ($ 3 ， $i ， $ 3 , $ 5 )= ( 无 4 , $ 1 ， $ 3 , 无 2 ). 

This operation transforms the states in a much more regular way than ( 187 ): 




If a takes the literal u into we write ua = and in such cases a also takes u 
into v. Thus we always have ua = ua. We also write xa for the result of applying 
a to a sequence x of literals; for example, (xi^X 2 ^xs^X 4 )a = ( 无 4 ， $i ， $ 3 , 无 2 ). This 
mapping is a symmetry or automorphism of f(x) if and only if f(x) = f(xa) 
for all x. Exercises 474 and 475 discuss basic properties of such symmetries; see 
also exercise 7.2.1.2-20. 


Notice that a signed permutation can be regarded as an unsigned permuta¬ 
tion of the 2n literals {$ 1 ， r n ， xi ， x n }，and as such it can be written as 
a product of cycles. For instance，the symmetry 4132 corresponds to the cycles 
(142)(142)(3)(3). We can multiply signed permutations by multiplying these 
cycles in the normal way, just as in Section 1.3.3. 

The product or of two symmetries a and r is always a symmetry. Thus in 
particular, if a is any symmetry，so are its powers d 2 , cr 3 , etc. We say that a has 
order r if a, a 2 , • • ” a r are distinct and a r is the identity. A signed permutation 
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of order 1 or 2 is called a signed involution] this important special case arises if 
and only if a is its own inverse (a 2 = 1 ). 

It’s clearly easier to work with permutations of 2n literals than to work 
with permutations of 2 n states x\ .. .x n . The main advantage of a signed 
permutation a is that we can test whether or not a preserves the family F 
of clauses in a satisfiability problem. If it does, we can be sure that a also is an 
automorphism when it acts on all 2 n states. (See exercise 492.) 

Let’s go back to the example waerden(3^ 10; 97) that we’ve often discussed 
above. These clauses have an obvious symmetry，which takes xix ‘2 ... X 97 1 -^ 
XQrXge .. .x\. If we don’t break this symmetry, Algorithm C typically verifies 
unsatisfiability after about 530 Mfx of computation. Now Theorem E tells us 
that we can also assert that X\X 2 X^ < ^ 97 X 96 ^ 95 , say; but that symmetry-breaker 
doesn’t really help at all，because x\ has very little influence on Fortunately, 
however, Theorem E allows us to choose any permutation P 1 P 2 ••• Pn on which to 
base lexicographic comparisons. For example, we can assert that 0 : 48 ^ 47 X 46 ... < 
^ 50 ^ 51^52 - - - — provided that we don’t also require ^ 1 X 2^3 ... < ^ 97 ^ 96 ^ 95 … • 
(One fixed global ordering must be used, but the endomorphs can be arbitrary.) 

Even the simple assertion that < $ 50 , which is the clause ‘48 50’， cuts 
the running time down to about 410 M//，because this new clause combines nicely 
with the existing clauses 4648 50, 48 49 50, 48 50 52 to yield the helpful binary 
clauses 46 50, 49 50, 50 52. If we go further and assert that < 0 : 50 ^ 51 , the 

running time improves to 345 M//. And the next steps ^ 43 ^ 47^46 < ^ 50 ^ 51 ^ 52 ^ 
… ， a ： 48^47^46^45^44^43 < ^50^51^52^53^54^55 take us down to 290 M//，then 
260 M//，235 M//，220 M#; we’ve saved more than half of the running time by ex¬ 
ploiting a single reflection symmetry! Only 16 simple additional clauses，namely 

4850， 48 ai，50 ai，47 51 ai, 47 ai, 51 ai, 4652 … ， 4355 如 

are needed to get this speedup, using the efficient encoding of lex order in ( 169 ). 

Of course all good things come to an end，and we’ve now reached the point of 
diminishing returns: Further clauses to assert that ^ 43^47 ... X 42 < $ 50 $ 5 i … $56 
in the waerden(3, 10; 97) problem turn out to be counterproductive. 

A wonderful simplification occurs when a symmetry cr is a signed involu¬ 
tion that has comparatively few 2-cycles. Suppose, for example, that o — 
532416987; in cycle form this is (15)(15) (23) (23) (4) (4) ( 66 ) (79) (79) ( 88 ). Then the 

lexicographic relation x = x 1 ... x 9 < x[ .. .x f 9 = xo holds if and only if X\X 2 X^ < 
x^x^x^. The reason is clear，once we look closer (see F. A. Aloul ， A. Ramani，L L. 
Markov, and K. A. Sakallah, IEEE Trans. CAD-22 (2003), 1117-1137, §III.C): 

The relation x 1 .. .x 9 < x[ ... means，in this case ， u x\ < X 5 ； if x\ — then 
x ‘2 < X 3 ； if xi = and X 2 — xs then xs < ^ 2 ； if x\ — X 2 = 无 3 , and xs = X 2 
then X 4 < X 4 ； if x\ = X 2 = 龙 3 ，冗 3 = 龙 2 ， and X 4 = X 4 then < xi ； if 
x\ = X 5 ) x ‘2 = X 3 ^ X 3 = X 2 ^ X 4 = $ 4 ， and = x\ then xq < xq] if x\ = X 5 , 
X 2 = 龙 3， $3 = 无 2， $4 = $ 4 , $5 = ， and xq = xq then we’re done for •” With 

this expanded description the simplifications are obvious. 

In general this reasoning allows us to improve Theorem E as follows: 
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Corollary E. Let P 1 P 2 … p n be any permutation of {1 ， 2,… ， n}. For every 

signed involution o that is a symmetry of clauses F，we can write a in cycle 

form _ _ 

(Ph 干办 ） ( 仍 2 士 Pj2XPi2 干 Pj2 ) (Pit 士 PjtXPit TPj t ) (l 92 ) 

with ii < ji, i 2 < j 2 ， ••• ， i t < j t ，h < h < ••• < it，and with (p ik ^fp jk ) 
omitted when = jk; and we^re allowed to append clauses to F that assert the 
lexicographic relation x Pii x Pi2 … x Piq < $ 士 Pjl $ 士巧 2 •…$ 士巧 9 ， where q = t or q is 
the smallest k with = jV I 

In the common case when g is an ordinary signless involution，all of the signs 
can be eliminated here; we simply assert that x Pii • •. x Pif < x Pji • • • x Pjt . 

This involution principle justifies all of the symmetry-breaking techniques 
that we used above in the pigeonhole and quad-free matrix problems. See, for 
example, the details discussed in exercise 495. 

The idea of breaking symmetry by appending clauses was pioneered by J.-F. 

Puget [LNCS 689 (1993), 350-361], then by J. Crawford, M. Ginsberg, E. Luks, 
and A. Roy [Int. Conf. Knowledge Representation and Reasoning 5 (1998) ， 148 — 
159], who considered unsigned permutations only. They also attempted to dis¬ 
cover symmetries algorithmically from the clauses that were given as input. Ex¬ 
perience has shown ， however，that useful symmetries can almost always be better 
supplied by a person who understands the structure of the underlying problem. 

Indeed，symmetries are often “semantic” rather than “syntactic •” That is ， 
they are symmetries of the underlying Boolean function, but not of the clauses 
themselves. In the Zarankiewicz problem about quad-free matrices, for example ， 
we appended efficient cardinality clauses to ensure that ^ Xij > r; that condition 
is symmetric under row and column swaps, but the clauses are not. 

In this connection it may also be helpful to mention the monkey wrench prin¬ 
ciple: All of the techniques by which weVe proved quickly that the pigeonhole 
clauses are unsatisfiable would have been useless if there had been one more 
clause such as (xoi V x\\ V 无 22 ); that clause would have destroyed the symmetry! 

We conclude that we’re allowed to remove clauses from F until reaching a 
subset of clauses Fq for which symmetry-breakers S can be added. If F = F 0 UFi, 
and if Fq is satisfiable F 0 \J S is satisfiable，then F 0 U 5 h e F h e. 

One hundred test cases. And now — ta da! — let’s get to the climax of this 
long story，by looking at how our SAT solvers perform when presented with 100 
moderately challenging instances of the satisfiability problem. The 100 sets of 
clauses summarized on the next two pages come from a wide variety of different 
applications, many of which were discussed near the beginning of this section ， 
while others appear in the exercises below. 

Every test case has a code name, consisting of a letter and a digit. Table 6 
characterizes each problem and also shows exactly how many variables ， clauses ， 
and total literals are involved. For example, the description of problem A1 
ends with < 20431 24772 15519511 /; this means that A1 consists of 24772 clauses on 2043 
variables, having 55195 literals altogether, and those clauses are unsatisfiable. 
Furthermore, since ‘24772’ is underlined, all of ATs clauses have length 3 or less. 
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Table 6 

CAPSULE SUMMARIES OF THE HUNDRED TEST CASES 


Al. Find x = X 1 X 2 . - - xqq with vx — 27 and 
no three equally spaced Is. (See exercise 31.) 

2043|24772|55195|U 

A2. Like Al, but X\X 2 - - - ^ioo- 

2071I25197I56147IS 

Bl. Cover a mutilated 10 X 10 board with 
49 dominoes, without using extra clauses 

to break symmetry. i76|572|i300|u 

B2. Like Bl, but a 12 X 12 board with 
71 dominoes. 260|856|i948|u 

Cl- Find an 8-step Boolean chain that 
computes ( 22 ^ 1 ^ 0)2 — x\ -\- X 2 + $3 + ^4- 
(See exercise 479(a).) 384|i6944|66336|u 

C2. Find a 7-step Boolean chain that 
computes the modified full adder functions 
之 1 ， 2 ： 2, Z 3 in exercise 481(b). 469|26637|iooo63|u 

C3- Like C2, but with 8 steps. 

572|33675|134868|S 

C4. Find a 9 - step Boolean chain that 
computes z\ and z r in the mod-3 addition 

problem of exercise 480(b). 678|45098|i83834|s 

C5 - Connect A to A, …， J to J in Dudeney’s 
puzzle of exercise 392 ， (iv). i980|225i8|70356|s 

C6 - Like C5, but move the J in row 8 from 

column 4 to column 5. 1980|22518|70356|U 

C7- Given binary strings Si , …， 550 of 
length 200, randomly generated at distances 
< rj from some string find x (see 
exercise 502). 65719157736811659623 is 

C8- Given binary strings si, …， 540 of 
length 500, inspired by biological data, find 
a string at distance < 42 from each of them. 

1235401 909120 I 2569360IU 

C9 - Like C8, but at distance < 43. 

1241001926200 12620160 |S 

Dl. Satisfy factor-fifo(18, W, 111111111111). 
(See exercise 41.) i94p|6374ii6498iu 

D2. Like Dl, but factorJifo. i94p|6374ii6498iu 

D3. Like Dl，but (19, 19, 111111111111). 

2052167451 17461|S 

D4. Like D2, but (19, 19, 111111111111). 

2052I6745I1746HS 

D5. Solve (x! ... x 9 ) 2 x (y! … 2 / 9)2 7 ^ 

... ^ 9)2 X’ （ 2/1 … 2/9)2, with two copies 
of the same Dadda multiplication circuit. 

864|2791|7236|U 

E0- Find an Erdos discrepancy pattern 
xi ... X 500 (see exercise 482). 1603191571274691 S 

El* Like E0, but x\ ... 2 : 750 - 

2556|14949|44845|S 

E2* Like E0, but x\ ... ^ 1000 - 

3546|21035|63103|S 

FI. Satisfy fsnark(99). (See exercise 176.) 

1782|4161|8913|U 


F2* Like FI, but without the clauses 

(ei,3 V /99,3) 八 (/l,l V 62 ,l)- 1782|4159|8909|S 

Gl- Win Late Binding Solitaire with the 
“most difficult winnable deal” in answer 486. 

1242|22617|65593|S 

G2* Like Gl, but with the most difficult 
unwinnable deal. 12421 226i2|65588|u 

G3- Find a test pattern for the fault u Kl 
stuck at 0” in prod (16, 32). 34981113371290971 s 

G4* Like G3, but for the fault U D^ 9 stuck 

at 0.” 3502I11349I29127IS 

G5 - Find a 7 X 15 array Xo leading to 
X 3 = LIFE as in Fig. 35, having at most 38 

live cells. 7150|28508|71873|U 

G6. Like G5, but at most 39 live cells- 

7152I28536I71956IS 

G7- Like G5, but X 4 = LIFE and Xq can 

be arbitrary* 8725 13376918404i|u 

G8- Find a configuration in the Game 
of Life that proves /*(7,7) = 28 (see 
exercise 83) - 97909| 40i836 i 10201741 s 

K0- Color the 8 x 8 queen graph with 8 
colors, using the direct encoding ( 15 ) and 

( 16 ) , also forcing the colors of all vertices 

in the top rOW. 512|5896|12168|U 

Kl. Like K0, but with the exclusion clauses 

( 17 ) also. 512|7688|15752|U 

K2* Like Kl, but with kernel clauses instead 
of ( 17 ) (see answer 14). 5i2|6408|24328|u 

K3- Like Kl, but with support clauses 
instead of ( 16 ) (see exercise 399)- 

512|13512|97288|U 

K4. Like Kl, but using the order encoding 

for Colors. 448|6215|21159|U 

K5 - Like K4, but with the hint clauses 
( 162 ) appended. 448 1 6299 |2i663|u 

K6 - Like K5, but with double clique hints 
(exercise 396). 896 185591279271 u 

K7. Like Kl, but with the log encoding 
of exercise 391(a). 237615120115312 m 

K8- Like Kl, but with the log encoding 

192|5848|34968|U 
130|2437|5204|U 
273|1020|2370|U 
228|5875|12356|U 
502| 1857|4320|U 
1472|102922|210068|S 
3512|12768|29760|S 
6016|869650|1756964|S 
14704|53184|124032|S 


of exercise 391(b). 

LI* Satisfy langford(10). 

L2* Satisfy langford f (10). 

L3- Satisfy langford (13). 

L4. Satisfy langford f (13). 

L5. Satisfy langford (32)• 

L6. Satisfy langford f (32). 

L7. Satisfy lang ford (64). 

L8- Satisfy lang ford’ （ 64). 

Ml* Color the McGregor graph of order 
10 (Fig. 33) with 4 colors, using one color at 
most 6 times, via the cardinality constraints 

(l 8 ) and ( 19 ). 1064|2752|6244| 
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M2. Like Ml, but via ( 20 ) and ( 21 ). 

814|2502|5744|U 

M3- Like Ml, but at most 7 times. 

1161|2944|6726|S 

M4. Like M2, but at most 7 times- 

864|2647|6226|S 

M5 - Like M4, but order 16 and at most 

11 times. 2256|7801|18756|U 

M6. Like M5, but at most 12 times. 

2288|8080|19564|S 

MT* Color the McGregor graph of order 9 
with 4 colors, and with at least 18 regions 
doubly colored (see exercise 19)- 

952|4539|13875|S 

M8- Like M7, but with at least 19 regions. 

952|4540|13877|U 

Nl. Place 100 nonattacking queens on 
a 100 X 100 board. 1000011151800123134001 s 

Ol- Solve a random open shop scheduling 
problem with 8 machines and 8 jobs, in 
1058 units of time. 508461 55782311621693 m 

02. Like Ol, but in 1059 units. 

509011558534 11623771 IS 

PO. Satisfy ( 99 ), (ioo), and ( 101 ) for 
m = 20, thereby exhibiting a poset of size 
20 with no maximal element- 400|7260|22080|u 

PI- Like P0, but with m = 14 and using 
only the clauses of exercise 228. i96|847|2667|u 

P2. Like P0, but with m = 12 and using 
only the clauses of exercise 229. i44|530|i674|u 

P3- Like P2, but omitting the clause 

( 无 31 V 无 16 V X3q). 144|529|1671|S 

P4. Like P3, but with m = 20. 400|2509|7827|s 
Q0- Like K0, but with 9 colors- 

576|6624|13688|S 

Ql- Like Kl, but with 9 colors. 

576|8928|18296|S 

Q2. Like K2, but with 9 colors- 

576|7200|27368|S 

Q3- Like K3, but with 9 colors. 

576|15480|123128|S 

Q4. Like K4, but with 9 colors. 

512|7008|24200|S 

Q5 - Like K5, but with 9 colors. 

512|7092|24704|S 

Q6- Like K6, but with 9 colors. 

1024|9672|31864|S 

Q7- Like K7, but with 9 colors. 

3168|6776|20800|S 

Q8. Like K8, but with 9 colors. 

256|6776|52832|S 

Q9 - Like Q8, but with the log encoding 
of exercise 391(c). 256|6584|42256|S 

Rl. Satisfy rand (3, 1061 ， 250, 314159). 

250|1061|3183|S 

R2. Satisfy rand (3, 1062, 250, 314159)• 

250I1062I3186IU 


SI- Find a 4-term disjunctive normal form 
on {：ri ” • • ， X 20 } that differs from ( 27 ) but 
agrees with it at 108 random training points. 

356|4229|16596|S 

S2. Like Sl，but at 109 points. 

360|4310|16760|U 

S3- Find a sorting network on nine 
elements that begins with the comparators 
[1:6] [2:7] [3:8] [4:9] and finishes in five more 
parallel rounds. (See exercise 64.) 

5175|85768|255421|U 

S4. Like S3, but in six more rounds. 

6444|107800|326164|S 

Tl. Find a 24 x 100 t at ami tiling that spells 
‘TATAMI’ as in exercise 118. 2874|io527|26ii2|s 

T2. Like Tl, but 24 x 106 and the C V should 

have Serifs. 3048|11177|27724|U 

T3- Solve the TAOCP problem of exercise 
389 with only 4 knight moves. 

3752|12069|27548|U 

T4. Like T3, but with 5 knight moves. 

3756|12086|27598|S 

T5 - Find the pixel in row 5, column 
18 of Fig. 37(c), the lexicographically last 
solution to the Cheshire Tom problem. 

8837I39954I100314IS 

T 6 . Like T5, but column 19. 

8837I39955I100315IU 

T7- Solve the run-count extension of the 
Cheshire Tom problem (see exercise 117). 

25734|65670|167263|S 

T 8 - Like T7, but find a solution that differs 
from Fig. 36. 25734|65671|167749|U 

Wl. Satisfy waerden(3^10; 97). 

97|2779|11662|U 

W2. Satisfy waerden(3^13] 159). 

159|7216|31398|S 

W3. Satisfy waerden (5^5] 177). 

177|7656|38280|S 

"W4. Satisfy waerden (5^5; 178). 

178|7744|38720|U 

XI• Prove that the “taking turns” 
protocol ( 43 ) gives mutual exclusion for 
at least 100 steps. ioio|36i2|io6i4|u 

X2. Prove that assertions <E> for the four-bit 
protocol of exercise 101 , analogous to ( 50 )， 
are invariant. 12913541 926|u 

X3 - Prove that Bob won’t starve in 36 steps, 
assuming the 少 of X2. i652|io552|2897i|u 

X4* Prove that there’s a simple 36-step 
path with the four-bit protocol，assuming 
the 少 of X2. 22199|50264| 130404|S 

X5 - Like X4, but 37 steps. 23388|52822|137034|U 

X 6 - Like XI, but with Peterson’s proto¬ 
col ( 49 ) instead of ( 43 ). 2218 I 80201232221 U 

X7 - Prove that there’s a simple 54-step 
path with protocol ( 49 ). 2645015631211475721 s 

X 8 - Like X7, but 55 steps. 

27407|58317|152807|U 
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Of course we can’t distinguish hard problems from easy ones by simply 
counting variables, clauses, and literals. The great versatility with which clauses 
can capture logical relationships means that different sets of clauses can lead to 
wildly different phenomena. Some of this immense variety is indicated in Fig. 52, 
which depicts ten instructive “variable interaction graphs •” Each variable is 
represented by a ball，and two variables are linked when they appear together 
in at least one clause. (Some edges are darker than others; see exercise 506. For 
further examples of such 3D visualizations，presented also in color，see Cars ten 

Sinz, Journal of Automated Reasoning 39 (2007), 219-243.) 

A single SAT solver cannot be expected to excel on all of the many species of 
problems. Furthermore, nearly all of the 100 instances in Table 6 are well beyond 
the capabilities of the simple algorithms that we began with: Algorithms A ， B ， 
and D are unable to crack any of those test cases without needing more than 
fifty gigamems of computation, except for the simplest examples — Ll ， L2, L5, 
P3 ， P4, and X2. Algorithm L，the souped-up refinement of Algorithm D，also 
has a lot of difficulty with most of them. On the other hand，Algorithm C does 
remarkably well. It polishes off 79 of the given problems in fewer than ten Gfi. 

Thus the test cases of Table 6 are tough, yet they’re within reach. Almost all 
of them can be solved in say two minutes，at most，with methods known today. 

Complete details can be found in the file SATexamples • tgz on the author’s 
website, together with many related problems both large and small. 

Exactly 50 of these 100 cases are satisfiable. So we’re naturally led to wonder 
whether Algorithm W (“WalksAT ”） will handle such cases well. The answer is 
that Algorithm W sometimes succeeds brilliantly — especially on problems C7, 
C9, L5, L7, M3, M4, M6, P3, P4, Q0, Ql, Rl, SI, where it typically outperforms 

all the other methods we’ve discussed. In particular it solved SI in just 1 M//，in 
the author’s tests，compared to 25 Mfi by the next best method, Algorithm C; it 
won by 15 Mfi versus Algorithm C’s 83 M/x on M3, by 83 Mfi versus Algorithm L’s 
104 Mfi on Q0, by 95 Mfi versus Algorithm C’s 464 Mfi on Ql，and by a whopping 
104 M/x versus Algorithm C’s 7036 M// on C7. That was a surprise. WalkSAT 
also was reasonably competitive on problem Nl. But in all other cases it was 
nowhere near the method of choice. Therefore we’ll consider only Algorithms L 
and C in the remainder of this discussion.* 

When does a lookahead algorithm like Algorithm L outperform a clause¬ 
learning algorithm like Algorithm C? Figure 53 shows how they compare to 
each other on our 100 test cases: Each problem is plotted with Algorithm C’s 
running time on the vertical axis and Algorithm L’s on the horizontal axis. 
Thus Algorithm L is the winner for problems that appear above the dotted line. 
(This dotted line is “wavy” because times aren’t drawn to scale: The A:th fastest 
running time is shown as k units from the left of the page or from the bottom.) 

* There actually are two variants of Algorithm L, because the alternative heuristics of 
exercise 143 must be used for looking ahead when clauses of length 4 or more are present- We 
could use exercise 143 even when given all-ternary clauses; but experience shows that we’d tend 
to lose a factor of 2 or more by doing so- Our references to Algorithm L therefore implicitly 
assume that exercise 143 is being applied only when necessary. 


Sinz 

SATexamples•tgz 

Knuth 

website 

Knuth 

lookahead algorithm 
clause-learning algorithm 
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All of these experiments were aborted after 50 G//，if necessary，since many 
of these problems could potentially take centuries before running to completion. 
Thus the test cases for which Algorithm L timed out appear at the right edge of 
Fig. 53, and the tough cases for Algorithm C appear at the top. Only E2 and X8 
were too hard for both algorithms to handle within the specified cutoff time. 

Algorithm L is deterministic: It uses no random variables. However, a slight 
change (see exercise 505) will randomize it, because the inputs can be shuffled 
as they are in Algorithm C; and we might as well assume that this change has 
been made. Then both Algorithms L and C have variable running times. They 
will find solutions or prove unsatisfiability more quickly on some runs than on 
others, as we’ve already seen for Algorithm C in Fig. 49. 

To compensate for this variability, each of the runtimes reported in Fig. 53 is 
the median of nine independent trials. Figure 54 shows all 9 x 100 of the empirical 
running times obtained with Algorithm C，sorted by their median values. We 
can see that many of the problems have near-constant behavior; indeed，the ratio 
max/min was less than 2 in 38 of the cases. But 10 cases turned out to be highly 
erratic in these experiments, with max/min > 100; problem P4 was actually 
solved once after only 323 kilomems, while another run lasted 339 ^amems! 

One might expect satisfiable problems, such as P4，to benefit more from 
lucky guesses than unsatisfiable problems do; and these experiments strongly 
support that hypothesis: Of the 21 problems with max/min > 30， all but P0 
are satisfiable, and all 32 of the problems with max/min < 1.7 are unsatisfiable. 
One might also expect the mean running time (the arithmetic average) to exceed 
the median running time，in problems like this — because bad luck can be signifi¬ 
cantly bad, though hopefully rare. Yet the mean is actually smaller than the me¬ 
dian in 30 cases, about equally distributed between satisfiable and unsatisfiable. 

The median is a nice measure because it is meaningful even in the presence 
of occasional timeouts. It’s also fair，because we are able to achieve the median 
time, or better, more often than not. 

We should point out that input/output has been excluded from these time 
comparisons. Each satisfiability problem is supposed to appear within a com¬ 
puter^ memory as a simple list of clauses，after which the counting of mems 
actually begins. We include the cost of initializing the data structures and solving 
the problem，but then we stop counting before actually outputting a solution. 

Some of the test cases in Table 6 and Fig. 53 represent different encodings 
of the same problem. For example, problems K0-K8 all demonstrate that the 
8x8 queen graph can’t be colored with 8 colors. Similarly, problems Q0-Q9 all 
show that 9 colors will suffice. We’ve already discussed these examples above 
when considering alternative encodings; and we noted that the best solutions ， 
K6 and Q5, are obtained with an extended order encoding and with Algorithm C. 
Therefore the fact that Algorithm L beats Algorithm C on problems K0 ， Kl ， 
K2, and K3 is somewhat irrelevant; those problems won’t occur in practice. 

Problems L5 and L6 compare different ways to handle the at-most-one 
constraint. L6 is slightly better for Algorithm L，but Algorithm C prefers L5. 
Similarly，Ml and M2 compare different ways to deal with a more general 


deterministic 
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Fig. 54. Nine random running times of Algorithm C，sorted by their medians. 
(Unsatisfiable cases have solid dots or squares; satisfiable cases are hollow.) 


cardinality constraint. Here M2 turns out to be better，although both are quite 
easy for Algorithm C and difficult for Algorithm L. 

We’ve already noted that Algorithm L shines with respect to random prob¬ 
lems such as R1 and R2，and it dominates all competitors even more when 
unsatisfiable random 3SAT problems get even bigger. Lookahead methods are 
also successful in waerden problems like W1-W4. 

Unsatisfiable Langford problems such as L3 and L4 are definitely betes noires 
for Algorithm C，although not so bad for Algorithm L. Even the world’s fastest 
CDCL solver ， “Treengeling，” was unable to refute the clauses of langford (17) 
in 2013 until it had learned 26.7 billion clauses; this process took more than a 
week，using a cluster of 24 computers working together. By contrast, the dancing 
links method of Section 7.2.2.1 was able to prove unsatisfiability after fewer than 
7.2 T// of computation — that’s about 90 minutes on a single vintage-2013 CPU. 

We’ve now discussed every case where Algorithm L trumps Algorithm C ， 
except for D5; and D5 is actually somewhat scandalous! It’s an inherently simple 
problem that hardware designers call a “miter” ： Imagine two identical circuits 
that compute some function f(x \ ， • • • ， x n )，one with gates gi ， … ， g m and another 
with corresponding gates g[^ ..., all represented as in ( 24 ). The problem is 
to find x\ .. .x n for which the final results g m and g' m aren’t equal. It’s obviously 
unsatisfiable. Furthermore ， there’s an obvious way to refute it，by successively 
learning the clauses (^V^), ( 甙 V 仍）， ( 如 Vg' 2 ) ， ( 远 V 夕 2 ) ， etc. In theory, therefore, 
Algorithm C will almost surely finish in polynomial time (see exercise 386). 
But in practice, the algorithm won’t discover those clauses without quite a lot 
of flailing around，unless special-purpose techniques are introduced to help it 
discover isomorphic gates. 
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Thus Algorithm C does have an Achilles heel or two. On the other hand, it empirical performance measurements++ 

is the clear method of choice in the vast majority of our test cases，and we can 
expect it to be the major workhorse for most of the satisfiability problems that 
we encounter in daily work. Therefore it behooves us to understand its behavior 
in some detail，not just to look at its total cost as measured in mems. 

Table 7 

ALGORITHM C，S EMPIRICAL BEHAVIOR ON THE HUNDRED TEST CASES 
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1+31898 M/i 
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0+53739 M/x 
2+248789 M/x 
0+295571 M/x 
0+677815 Mu 
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Table 7 summarizes the salient statistics，again listing all cases in order of 
their median running time (exclusive of input and output). Each running time 
is actually broken into two parts, L x+y\ where x is the time to initialize the 
data structures in step Cl and y is the time for the other steps，both rounded 
to megamems. For example, the exact median processing time for case L5 
was 1,484,489// to initialize，then 655,728/i to find a solution; this is shown 
as in the third line of the table. The time for initialization is usually 

negligible except when there are many clauses, as in problem Nl. 
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The median run of problem L5 also allocated 1 ， 841，372 bytes of memory for 
data; this total includes the space needed for 210,361 cells in the MEM array, at 
4 bytes per cell，together with other arrays such as VAL ， OVAL, HEAP, etc. The 
implementation considered here keeps unlearned binary clauses in a separate 
BIMP table, as explained in the answer to exercise 267. 

This run of L5 found a solution after implicitly traversing a search tree with 
138 “nodes •” The number of nodes, or “decisions，” is the number of times step C 6 
of the algorithm goes to step C3. It is shown as C 0K’ in Table 7， because the 
node counts, byte counts, and cell counts are rounded to the nearest thousand. 

The number of nodes always exceeds or equals the number of learned clauses ， 
which is the number of conflicts detected at levels d > 0. (See step C7.) In the 
case of problem L5, only 84 clauses were learned; so again the table reports c 0 K\ 
These 84 clauses had average length r + 1 = 146.1; then the simplification process 
of exercise 257 reduced this average to just 38.4. Nevertheless, the resulting 
simplified clauses were still sufficiently long that the “trivial” clauses discussed 
in exercise 269 were sometimes used instead; this substitution happened 43 
times (51 %). Furthermore 19 of the learned clauses (23%) were immediately 
discarded, using the method of exercise 271. These percentages show up in the 
‘triv’ and ‘disc’ columns of the table. 

Sometimes, as in problems Dl—D5, a large majority of the learned clauses 
were replaced by trivial ones; on the other hand, 27 of the 100 cases turned out 
to be less than 10% trivial in this sense. Table 7 also shows that the discard 
rate was 5 % or more in 26 cases. The c sub’ column refers to learned clauses that 
were “subsumed on the fly” by the technique of exercise 270; this optimization 
is less common，yet it occurs often enough to be worthwhile. 

The great variety in our examples is reflected in the variety of behaviors ex¬ 
hibited in Table 7, although several interesting trends can also be perceived. For 
example，the number of nodes is naturally correlated with the number of learned 
clauses, and both statistics tend to grow as the total running time increases. But 
there are significant exceptions: Two outliers ，01 and 02 ， have a remarkably 
high ratio of mems per learned clause, because of their voluminous data. 

The penultimate column of Table 7 counts how often Algorithm C decided 
to restart itself after flushing unproductive literals from its current trail. This 
quantity does not simply represent the number of times step C5 discovers that 
M > Mf ； it depends also on the current agility level (see ( 127 )) and on the 
parameter ^ in Table 4. Some problems, like A1 and A2, had such high agility 
that they were solved satisfactorily with no restarts whatsoever; but another 
one ， T4, finished in about 500 megamems after restarting more than 3000 times. 

The number of “purges” (recycling phases) is not shown, but it can be 
estimated from the number of learned clauses (see exercise 508). An aggressive 
purging policy has kept the total number of memory cells comfortably small. 

Tuning up the parameters. Table 7 shows that the hardest problem of all for 
Algorithm C in these experiments, L4, found itself substituting trivial clauses 
86 % of the time but making only 5 restarts. That test case would probably have 
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been solved much more quickly if the algorithm’s parameters had been specially 
adjusted for instances of the Langford problem. 

Algorithm C，as implemented in the experiments above，has ten major 
parameters that can be modified by the user on each run: 

a, tradeoff between p and q in clause RANGE scores (see Eq. ( 123 )); 
p，damping factor in variable ACT scores (see after ( 118 )); 
q, damping factor in clause ACT scores (see Eq. ( 125 )); 

△ p ，initial value of the purging threshold M p (see after ( 125 )); 

J p ，amount of gradual increase in M p (see after ( 125 )); 
t ， threshold used to prefer trivial clauses (see answer to exercise 269); 
w, full “warmup” runs done after a restart (see answer to exercise 287); 
p, probability of choosing a decision variable at random (see exercise 266); 
P, probability that OVAL (AO is initially even; 

也 agility threshold for flushing (see Table 4). 

The values for these parameters initially came from seat-of-the-pants guesses 

a = 0.2, p = 0.95, g = 0.999, A p - 20000, 心 = 500, 

t = 1 ， w; = 0 ， p = 0.02, P = 0, -0 — 0.166667; ( 193 ) 

and these defaults gave reasonably good results, so they were used happily for 
many months (although there was no good reason to believe that they couldn’t be 
improved). Then finally, after the author had assembled the set of 100 test cases 
in Table 6 , it was time to decide whether to recommend the default values ( 193 ) 
or to come up with a better set of numbers. 

Parameter optimization for general broad-spectrum use is a daunting task ， 
not only because of significant differences between species of SAT instances but 
also because of the variability due to random choices when solving any specific 
instance. It’s hard to know whether a change of parameter will be beneficial or 
harmful，when running times are so highly erratic. Ouch — Fig. 54 illustrates 
dramatic variations even when all ten parameters are held fixed，and only the 
seed for random numbers is changed! Furthermore the ten parameters are not 
at all independent: An increase in p ， say, might be a good thing, but only if the 
other nine parameters are also modified appropriately. How then could any set 
of defaults be recommended, without an enormous expense of time and money? 

Fortunately there’s a way out of this dilemma，thanks to advances in the 
theory of learning. F. Hutter, H. H. Hoos ， K. Leyton-Brown, and T. Stiitzle have 
developed a tool called ParamILS intended specifically for making such tuneups 
[J. Artificial Intelligence Research 36 (2009), 267-306]; the ‘ILS’ in this name 
stands for “iterated local search •” The basic idea is to start with a representative 
training set of not-too-hard problems, and to carry out random walks in the 10 - 
dimensional parameter space using sophisticated refinements of WalkSAT-like 
principles. The best parameters discovered during this training session are then 
evaluated on more difficult problems outside the training set. 
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In March 2015, Holger Hoos helped the author to tune Algorithm C using 
ParamILS. The resulting parameters then yielded Fig. 54, and Table 7, and many 
other runtime values discussed above and below. Our training set consisted 
of 17 problems that usually cost less than 200 M// with the original parame¬ 
ters ( 193 ), namely {K5,K 6 ,M2, M4,Nl, SI,S4,X4,X 6 } together with stripped- 
down versions of {A1,C2,C3,D1,D2,D3,D4 ， K0}. For example, instead of the 
vector xi ... x\oo required by problem Al, we looked only for a shorter vector 
x — x\ ... : T 62 , now with vx — 20; instead of D1 and D2 we sought 13-bit factors 
of 31415926; instead of K0 we tried to 9-color the SGB graph jean. 

Ten independent training runs with ParamILS gave ten potential parameter 
settings (ai ， … ，如 ). We evaluated them on our original 17 benchmarks ， 
together with 25 others that were a bit more difficult: {FI, F2, S2, S3, T4, X5}, 
plus less-stripped-down variants of {Al, A2, A2, C7, C7, D3, D4, FI, F2, Gl, Gl, 
G2, G2, G 8 , K0, 01, 02, QO, Q2}. For each of the ten shortlisted parameter set¬ 
tings, we ran each of these 17 + 25 problems with each of the random seeds 
{1 ， 2,… ， 25}. Finally, hurray, we had a winner: The parameters (o^ ， … ，也） 
with minimum total running time in this experiment were 

a = 0.4, p 二 0.9, g = 0.9995, A p = 1000 ，心 = 500, 

t = 10 ， w = 0, p — 0.02, P = 0.5, 4 = 0.05. ( 194 ) 

And these are now the recommended defaults for general-purpose use. 

How much have we thereby gained? Figure 55 compares the running times of 
our 100 examples, before and after tuning. It shows that the vast majority — 77 
of them — now run faster; these are the cases to the right of the dotted line from 
(1 M/x, 1M//) to (1 T/n, 1 Tfi). Half of the cases experience a speedup exceeding 
1.455; 27 of them now run more than twice as fast as they previously did. 

Of course every rule has exceptions. The behavior of case P4 has gotten 
spectacularly worse，almost three orders of magnitude slower! Indeed，we saw 
earlier in Fig. 54 that this case has an amazingly unstable running time; further 
peculiarities of P4 are discussed in exercise 511. 

Our other major SAT solver，Algorithm L，also has parameters，notably 

a，magic tradeoff coefficient in heuristic scores (see Eq. ( 64 )); 

3, damping factor for double - look triggering (see step Yl); 

7 , clause weight per literal in heuristic scores (see exercise 175); 
s, offset in heuristic scores (see answer to exercise 146); 

©，maximum heuristic score threshold (see answer to exercise 145); 

Y"，maximum depth of double-lookahead (see step Yl). 

ParamILS suggests the following default values, which have been used in Fig. 53: 

a = 3.5, /? = 0.9998, 7 = 0.2, e = 0.001, © = 20.0, Y = 1. ( 195 ) 

Returning to Fig. 55, notice that the change from ( 193 ) to ( 194 ) has substan¬ 
tially hindered cases G3 and G4, which are examples of test pattern generation. 
Evidently such clauses have special characteristics that make them prefer special 
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Fig. 55. Median running times 
of Algorithm C, before and after 
its parameters were tuned. 
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settings of the parameters. Our main reason for introducing parameters in the 
first place was，of course, to allow tweaking for different families of clauses. 

Instead of finding values of ( 0 ^ p， … » that give good results in a broad 
spectrum of applications, we can clearly use a system like ParamILS to find 
values that are specifically tailored to a particular class of problems. In fact ， 
this task is easier. For example，Hoos and the author asked for settings of the 
ten parameters that will tend to make Algorithm C do its best on problems of 
the form waerden(3^ k] n). A pair of ParamILS runs, based solely on the easy 
training cases waerden(3^9; 77) and waerden(3, 10; 95)，suggested the parameters 

a = 0.5, p = 0.9995, g = 0.99, A p - 100, 心 =10, 

r 二 10 ， w = 8, p = 0.01, P = 0.5 ， 4 二 0.15 ， (ig 6 ) 

and this set indeed works very well. Figure 56 shows typical details, with 7 < 
k < 14 and with nine independent sample runs for every choice of k and n. 
Each unsatisfiable instance has n = W(3^ k)^ as given in the table following ( 10 ) 
above; each satisfiable instance has n = W(3^ k) — l. The fastest run using default 
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parameters (194) has been paired in Fig. 56 with the fastest run using waerden- 
tuned parameters (196); similarly，the second-fastest, ••• ， second-slowest, and 
slowest runs have also been paired. Notice that satisfiable instances tend to take 
an unpredictable amount of time，as in Fig. 54. In spite of the fact that the new 
parameters (196) were found by a careful study of just two simple instances，they 
clearly yield substantial savings when applied to much，much harder problems 
of a similar nature. (See exercise 512 for another instructive example.) 

Exploiting parallelism. Our focus in the present book is almost entirely on 
sequential algorithms, but we should be aware that the really tough instances of 
SAT are best solved by parallel methods. 

Problems that are amenable to backtracking can readily be decomposed into 
subproblems that partition the space of solutions. For example, if we have 16 
processors available, we can start them off on independent SAT instances in which 
variables x\X2X^x^ have been forced to equal 0000, 0001 ， … ， 1111. 

A naive decomposition of that kind is rarely the best strategy, however. 
Perhaps only one of those sixteen cases is really challenging. Perhaps some of 
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the processors are slower than others. Perhaps several processors will learn new 
clauses that the other processors ought to know. Furthermore, the splitting into 
subproblems need not occur only at the root of the search tree. Careful load¬ 
balancing and sharing of information will do much better. These challenges were 
addressed by a pioneering system called PSATO [H. Zhang, M. P. Bonacina, and 
J. Hsiang, Journal of Symbolic Computation 21 (1996), 543-560]. 

A much simpler approach should also be mentioned: We can start up many 
different solvers, or many copies of the same solver, with different sources of 
random numbers. As soon as one has finished, we can then terminate the others. 

The best parallelized SAT solvers currently available are based on the “cube 
and conquer” paradigm, which combines conflict-driven clause learning with 
lookahead techniques that choose branch variables for partitioning; see M. J. H. 

Heule, O. Kullmann, S. Wieringa, and A. Biere, LNCS 7261 (2012), 50-65. In 

particular, this approach is excellent for the waerden problems. 

Today has proved to be an epoch in my Logical work. 

...I think of calling it the 'Genealogical Method.' 

— CHARLES L. DODGSON, Diary (16 July 1894) 

The method of showing a statement to be tautologous 
consists merely of constructing a table under it in the usual way 
and observing that the column under the main connective 

is composed entirely of ‘T’s. 

— W. V. O. QUINE, Mathematical Logic (1940) 

A brief history. The classic syllogism “All men are mortal; Socrates is a man; 
hence Socrates is mortal” shows that the notion of resolution is quite ancient: 

-iMan V Mortal; -nSocrates V Man; :• -iSocrates V Mortal. 

Of course，algebraic demonstrations that {^x\/y) /\{-^z\/x) implieswhen 
X ， y, and 2 ： are arbitrary Boolean expressions, had to wait until Boole and his 
19th-century followers brought mathematics to bear on the subject. The most 
notable contributor ， resolutionwise，was perhaps C. L. Dodgson，who spent the 
last years of his life working out theories of inference by which complex chains of 
reasoning could be analyzed by hand. He published Symbolic Logic, Part I， in 
1896, addressing it to children and to the young-in-heart by using his famous pen 
name Lewis Carroll. Section VILIL§3 of that book explains and illustrates how 
to eliminate variables by resolution，which he called the Method of Underscoring. 

When Dodgson died unexpectedly at the beginning of 1898, his nearly com¬ 
plete manuscript for Symbolic Logic, Part II， vanished until W. W. Bartley III 
was able to resurrect it in 1977. Part II was found to contain surprisingly novel 
ideas — especially its Method of Trees，which would have completely changed 
the history of mechanical theorem proving if it had come to light earlier. In this 
method, which Carroll documented at length in a remarkably clear and enter¬ 
taining way，he constructed search trees essentially like Fig. 39, then converted 
them into proofs by resolution. Instead of backtracking as in Algorithm D ， 
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which is a recursive depth-first method, he worked breadth-first: Starting at the 
root, he exploited unit clauses when possible，and branched on binary (or even 
ternary) clauses when necessary，successively filling out all unfinished branches 
level-by-level in hopes of being able to reuse computations. 

Logicians of the 20th century took a different tack. They basically dealt with 
the satisfiability problem in its equivalent dual form as the tautology problem, 
namely to decide when a Boolean formula is always true. But they dismissed 
tautology-checking as a triviality, because it could always be solved in a finite 
number of steps by just looking at the truth table. Logicians were far more 
interested in problems that were provably unsolvable in finite time, such as 
the halting problem — the question of whether or not an algorithm terminates. 
Nobody was bothered by the fact that an n-variable function has a truth table 
of length 2 n ，which exceeds the size of the universe even when n is rather small. 

Practical computations with disjunctive normal forms were pioneered by 
Archie Blake in 1937, who introduced the “consensus” of two implicants，which 
is dual to the resolvent of two clauses. Blake’s work was ， however，soon forgotten; 
E. W. Samson ， B. E. Mills，and (independently) W. V. O. Quine rediscovered 
the consensus operation in the 1950s，as discussed in exercise 7.1.1-31. 

The next important step was taken by E. W. Samson and R. K. Mueller 
[Report AFCRC-TR-55-118 (Cambridge, Mass.: Air Force Cambridge Research 
Center ， 1955)，16 pages]，who presented an algorithm for the tautology problem 
that uses consensus to eliminate variables one by one. Their algorithm therefore 
was equivalent to SAT solving by successively eliminating variables via resolu¬ 
tion. Samson and Mueller demonstrated their algorithm by applying it to the 
unsatisfiable clauses that we considered in ( 112 ) above. 

Independently, Martin Davis and Hilary Putnam had begun to work on the 
satisfiability problem, motivated by the search for algorithms to deduce formulas 
in first order logic — unlike Samson, Mills，and Mueller，who were chiefly inter¬ 
ested in synthesizing efficient circuits. Davis and Putnam wrote an unpublished 
62-page report “Feasible computational methods in the propositional calculus” 
(Rensselaer Polytechnic Institute，October 1958) in which a variety of different 
approaches were considered, such as the removal of unit clauses and pure literals, 
as well as “case analysis，” that is, backtracking with respect to the subproblems 
F\x and F\x. As an alternative to case analysis，they also discussed eliminating 
the variable x by resolution. The account of this work that was eventually pub¬ 
lished [JACM 7 (1960) ， 201-215] concentrated on hand calculation, and omitted 
case analysis in favor of resolution; but when the process was later implemented 
on a computer, jointly with George Logemann and Donald Loveland [CACM 5 
(1962) ， 394—397]，the method of backtracking through different cases was found 
to work better with respect to memory requirements. (See Davis’s account of 
these developments in Handbook of Automated Reasoning (2001) ， 3-15.) 

This early work didn’t actually cause the satisfiability problem to appear 
on many people’s mental radar screens ， however. Far from it; ten years went 
by before SAT became an important buzzword. The picture changed in 1971 ， 
when Stephen A. Cook showed that satisfiability is the key to solving NP- 
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complete problems: He proved that any algorithm to solve a decision problem in 
nondeterministic polynomial time can be represented efficiently as a conjunction 

of ternary clauses to be satisfied. (See STOC 3 (1971), 151—158. We’ll study NP- 
completeness in Section 7.9.) Thus, a great multitude of hugely important prob¬ 
lems could all be solved rather quickly，if we could only devise a decent algorithm 
for a single problem, 3SAT; and 3SAT seemed almost absurdly simple to solve. 

A year of heady optimism following the publication of Cook’s paper soon 
gave way to the realization that ， alas ， 3SAT might not be so easy after all. 
Ideas that looked promising in small cases didn’t scale well, as the problem 
size was increased. Hence the central focus of work on satisfiability largely 
retreated into theoretical realms, unrelated to programming practice, except 
for occasional studies that used SAT as a simple model for the behavior of 
backtracking algorithms in general. Examples of such investigations, pioneered 
by A. T. Goldberg ， P. W. Purdom ， Jr., C. A. Brown, J. V. Franco, and others ， 
appear in exercises 213-216. See P. W. Purdom, Jr., and G. N. Haven, SICOMP 
26 (1997) ， 456—483, for a survey of subsequent progress on questions of that kind. 

The state of SAT art in the early 90s was well represented by an international 
programming competition held in 1992 [see M. Buro and H. Kleine Biining ， 
Bulletin EATCS 49 (February 1993), 143-151]. The winning programs in that 
contest can be regarded as the first successful lookahead solvers on the path from 
Algorithm A to Algorithm L. Max Bohm “took the gold” by choosing the next 
branch variable based on lexicographically maximal (Hi (x )^...where 

H k (x) = + ⑻ + ⑻）， h k (x) = \{C e F \ x e C,\C\ = k}\. 

[See M. Bohm and E. Speckenmeyer, Ann. Math. Artif. Intelligence 17 (1996), 
381-400. A. Rauzy had independently proposed a somewhat similar branching 
criterion in 1988; see Revue d , intelligence artiBcielle 2 (1988), 41-60.] The silver 
medal went to Hermann Stamm, who used strong components of the dependency 
digraph to narrow the search at each branch node. 

Advances in practical algorithms for satisfiability now began to take off. 
The benchmark programs of 1992 had been chosen at random, but the DIMACS 
Implementation Challenge of 1993 featured also a large number of structured in¬ 
stances of SAT. The main purpose of this “challenge” was not to crown a winner ， 
but to bring more than 100 researchers together for a three-day workshop, at 
which they could compare and share results. In retrospect, the best overall 
performance at that time was arguably achieved by an elaborate lookahead 
solver called C-SAT，which introduced techniques for detailed exploration of the 
first-order effects of candidate literals [see O. Dubois ， P. Andre ， Y. Boufkhad ， 
and J. Carlier, DIMACS 26 (1996), 415-436]. Further refinements leading 
towards the ideas in Algorithm L appeared in a Ph.D. thesis by Jon W. Freeman 
(Univ. of Pennsylvania, 1995)，and in the work of Chu Min Li，who introduced 
double lookahead [see Information Processing Letters 71 (1999) ， 75-80]. The 
weighted binary heuristic ( 67 ) was proposed by O. Dubois and G. Dequen, Proc. 
International Joint Conference on Artificial Intelligence 17 (2001) ， 248-253. 
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Meanwhile the ideas underlying Algorithm C began to emerge. Joao P. 
Marques-Silva, in his 1995 thesis directed by Karem A. Sakallah，discovered how 
to turn unit-propagation conflicts into one or more clauses learned at “unique 
implication points，” after which it was often possible to backjump past decisions 

that didn’t affect the conflict. [See IEEE Trans. C48 (1999), 506-521.] Similar 
methods were developed independently by R. J. Bayardo, Jr.，and R. C. Schrag 
[AAAI Conf. 14 (1997) ， 203—208]，who considered only the special case of clauses 
that include the current decision literal, but introduced techniques for purging 
a learned clause when one of its literals was forced to flip its value. Both groups 
limited the size of learned clauses，and noticed that their new methods gave 
significant speedups on benchmark problems related to industrial applications. 

The existence of fast SAT solvers, coupled with Gunnar Stalmarck^ new 
ideas about applying logic to computer design [see Swedish patent 467076 (1992 )]， 
led to the introduction of bounded model checking techniques by Armin Biere ， 

Alessandro Cimatti, Edmund Clarke, and Yunshan Zhu [LNCS 1579 (1999), 
193-207]. Satisfiability techniques had also been introduced to solve classical 
planning problems in artificial intelligence [Henry Kautz and Bart Selman, Proc. 
European Conf. Artificial Intelligence 10 (1992), 359-363]. Designers could now 
verify much larger models than had been possible with BDD methods. 

The major breakthroughs appeared in a solver called Chaff [M. W. Moske- 
wicz, C. F. Madigan, Y. Zhao, L. Zhang, and S. Malik, ACM/IEEE Design 
Automation Conf. 38 (2001) ， 530—535]，which had two especially noteworthy in¬ 
novations: (i) “VSIDS” (the Variable State Independent Decreasing Sum heuris¬ 
tic), a surprisingly effective way to select decision literals, which also worked well 
with restarts, and which suggested the even better ACT heuristic of Algorithm C 
that soon replaced it; also (ii) lazy data structures with two watched literals 
per clause，which made unit propagation much faster with respect to large 
learned clauses. (A somewhat similar watching scheme, introduced earlier by 

H. Zhang and M. Stickel [J. Automated Reasoning 24 (2000), 277-296], had the 

disadvantage that it needed to be downdated while backtracking.) 

These exciting developments sparked a revival of international SAT compe¬ 
titions, which have been held annually since 2002. The winner in 2002, BerkMin 
by E. Goldberg and Y. Novikov, has been described well in Discrete Applied 
Mathematics 155 (2007) ， 1549-1561. And year after year，these challenging con¬ 
tests have continued to spawn further progress. By 2010， more than twice as 
many benchmarks could be solved in a given period of time as in 2002， using 
the programs of 2002 and 2010 on the computers of 2010 [see M. Jarvisalo, 
D. Le Berre, O. Roussel, and L. Simon, AI Magazine 33,1 (Spring 2012), 89-94]. 

The overall champion in 2007 was SATzilla，which was actually not a 
separate SAT solver but rather a program that knew how to choose intelligently 
between other solvers on any given instance. SATzilla would first take a few sec¬ 
onds to compute basic features of a problem: the distribution of literals per clause 
and clauses per literal，the balance between positive and negative occurrences of 
variables, the proximity to Horn clauses ， etc. Samples could quickly be taken to 
estimate how many unit propagations occur at levels 1 ， 4 ， 16, 64, 256, and how 
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many decisions are needed before reaching a conflict. Based on these numbers, 
and experience with the performance of the other solvers on the previous year’s 
benchmarks, SATzilla was trained to select the algorithm that appeared most 
likely to succeed. This “portfolio” approach, which tunes itself nicely to the 
characteristics of vastly different sets of clauses, has continued to dominate the 
international competitions ever since. Of course portfolio solvers rely on the 
existence of “real” solvers，invented independently and bug-free, which shine with 
respect to particular classes of problems. And of course the winner of competi¬ 
tions may not be the best actual system for practical use. [See L. Xu, F. Hutter ， 
H. H. Hoos，and K. Leyton-Brown, J. Artificial Intelligence Research 32 (2008 )， 
565-606; LNCS 7317 (2012), 228-241; CACM 57,5 (May 2014), 98-107.] 

Historical notes about details of the algorithms, and about important related 
techniques such as preprocessing and encoding, have already been discussed 
above as the algorithms and techniques were described. 

One recurring theme appears to be that the behavior of SAT solvers is full of 
surprises: Some of the most important improvements have been introduced for 
what has turned out to be the wrong reasons, and a theoretical understanding 
is still far from adequate. 

[In future, the next breakthrough might come from “variable learning，” 
as suggested by Tseytin’s idea of extended resolution: Just as clause learning 
increases the number of clauses ， m, we might find good ways to increase the 
number of variables, n. The subject seems to be far from fully explored.] 

EXERCISES 

1. [10] What are the shortest (a) satisfiable (b) unsatisfiable sets of clauses? 

2. [20] Travelers to the remote planet Pincus have reported that all the healthy 
natives like to dance, unless they’re lazy. The lazy nondancers are happy, and so are 
the healthy dancers- The happy nondancers are healthy; but natives who are lazy and 
healthy aren’t happy- Although the unhappy, unhealthy ones are always lazy，the lazy 
dancers are healthy. What can we conclude about Pincusians, based on these reports? 

3. [M21] Exactly how many clauses are in waerden (j, k] n)? 

4. [22] Show that the 32 constraints of waerden (3^ 3; 9) in (g) remain unsatisfiable 
even if up to four of them are removed- 

5. [M46] Is W(3, A:) = Q(k 2 )? 

► 6 . [HM37] Use the Local Lemma to show that W (3, k) = Q(k 2 /(log k) 3 ). 

7. [21] Can one satisfy the clauses { (xiWx i + 2 ,Vx i+ 2 , + i) |l<i< n-2 d+ \d >0}U 
{{xiW x i+2d \/ x i+2d+ i) \ l<i<n- 2 d+ \d > 0 }? 

► 8. [20] Define clauses waerden (ko, ki, … ,kb-i] n) that are satisfiable if and only if 
n <W (ko, ki, … ,kb-i)- 

9. [24] Determine the value of W(2, 2, A:) for all A: > 0 - Hint: Consider k mod 6 . 

► 10. [21] Show that every satisfiability problem with m clauses and n variables can be 
transformed into an equivalent monotonic problem with m + n clauses and 2 n variables, 
in which the first m clauses have only negative literals, and the last n clauses are binary 
with two positive literals. 
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11. [27] (M. Tsimelzon ， 1994.) Show that a general 3SAT problem with clauses 
{Ci^ •…， C m } and variables {1 ， •…， n} can be reduced to a 3D MATCHING problem 
of size 10 m that involves the following cleverly designed triples: 

Each clause Cj corresponds to 3 x 10 vertices, namely lj^ lj^ |Z|j’，and \l\j n for each 
l 6 Cj, together with wj, xj, yj, and zj, and also fk and j"k for 1 < A: < 7 - If i or ^ oc¬ 
curs in t clauses Cj” •一， Cj t , there are t “true” triples {ij k , ij’ k , ij’(} and t “false” triples 

mocH )}，£or 1 幺 k 幺 t. Each clause Cj = (/i V h V I 3 ) also spawns three 
“satisfiability” triples j ， j’l ， /’l} ， {/ 2 j,/l, /^}, { 『 3 j ， j’l ， /’3}; six “filler” triples 
{hjj%fl}, { 『 u ， /3 ， j"l} ， {Z 2 j ， /4 ， /2} ， { 『 2 j ， /5 ， j"2} ， {Z 3 j ， /6 ， j"3} ， { 『 3 j ， /7, j"3} ; 
and twelve “gadget” triples {wj ， j’2 ， /’4 }， {wj.fA.fA}, {xj ， /2 ， j"5 }， 

{ 巧，乃， /’5 }， {xjJ f 7J n 5}, {"j ， /3 ， /’6} ， {W ， j’4 ， j"6 }， {yjj7,j%}, {zj,j%j"7}, 
{zj ， j’5 ， /’7} ， {zj ， j’6 ， /’7}. Thus there are 27m triples altogether. 

For example, Rivest’s satisfiability problem ( 6 ) leads to a 3D matching prob¬ 
lem with 216 triples on 240 vertices; the triples that involve vertices 18 and 18 are 
{18,18', 18"}, {18,18 , ,ll /, }, {18,8 , 1,8 ,, 2}, {18, 8,4, 8"2}, {18, 8'5,8"2}. 

12. [21] (M. J. H. Heule.) Simplify ( 13 ) by exploiting the identity 

… ， yp) = , yj,t) A S<i(t, y j+ i,..., y P )). 

13. [24] Exercise 7.2.2.1-00 defines an exact cover problem that corresponds to Lang¬ 
ford pairs of order n. (See page vii.) 

a) What are the constraints analogous to ( 12 ) when n = 4? 

b) Show that there’s a simple way to avoid duplicate binary clauses such as those 
in ( 14 ), whenever an exact cover problem is converted to clauses using ( 13 ). 

c) Describe the corresponding clauses langford (4) and langford’ （ 4：). 

14. [22] Explain why the clauses ( 17 ) might help a SAT solver to color a graph. 

15. [24] By comparing the McGregor graph of order 10 in Fig. 33 
with the McGregor graph of order 3 shown here, give a precise 
definition of the vertices and edges of the McGregor graph that 
has an arbitrary order n > 3 - Exactly how many vertices and 
edges are present in this graph, as a function of n? 

16. [21] Do McGregor graphs have cliques of size 4? 

17. [26] Let f(n) and g(n) be the smallest and largest values of r such that Mc- 
Gregor’s graph of order n can be 4-colored, and such that some color appears exactly 
r times. Use a SAT solver to find as many values of f(n) and g{n) as you can. 

► 18. [28] By examining the colorings found in exercise 17, define an explicit way to 
4-color a McGregor graph of arbitrary order n, in such a way that one of the colors is 
used at most |n times. Hint: The construction depends on the value of n mod 6 . 

► 19. [29] Continuing exercise 17, let h{n) be the largest number of regions that can be 
given two colors simultaneously (without using the clauses ( 17 )) - Investigate h(n). 

20. [40] In exactly how many ways can McGregor’s map (Fig. 33) be four-colored? 

21. [22] Use a SAT solver to find a minimum-size kernel in the graph of Fig. 33- 

22. [20] Color the graph C 5 13 C 5 with the fewest colors. (Two vertices of this graph 
can receive the same color if and only if they are a king move apart in a 5 x 5 torus.) 

23. [20] Compare the clauses ( 18 ) and ( 19 ) to ( 20 ) and ( 21 ) in the case n = 7, r = 4. 
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► 24. [M32] The clauses obtained from ( 20 ) and ( 21 ) in the previous exercise can be 
simplified, because we can remove the two that contain the pure literal b\. 

a) Prove that the literal is always pure in ( 20 ) and ( 21 )，when r > n/2. 

b) Show that b\ might also be pure in some cases when r < n/2. 

c) The clauses obtained from ( 20 ) and ( 21 ) have many pure literals bj when r has its 
maximum value n — 1. Furthermore, their removal makes other literals pure. How 
many clauses will remain in this case after all pure literals have been eliminated? 

d) Show that the complete binary tree with n > 2 leaves is obtained from complete 
binary trees with n and n" = n — n leaves, where either n or n n is a power of 2 . 

e) Let a(n ， r) and c(n ， r) be respectively the number of auxiliary variables bj and 
the total number of clauses that remain after all of the pure auxiliary literals have 
been removed from ( 20 ) and ( 21 ). What are a(2 fc , 2 fc_1 ) and c(2 fc , 2 k ~ 1 )? 

f) Prove that a(n ， r) = a(n ， n") = a(n, n’）for n" < r < and this common value is 
maxi< r<n a(n, r). Also a(n ， r) = a(n，n — r); and c(n,r) > c(n, n — r) if r < n/ 2 . 

25. [21] Show that ( 18 )—( 19 ) and ( 20 )—( 21 ) are equally effective when r = 2. 

26. [22] Prove that Sinz’s clauses ( 18 ) and ( 19 ) enforce the cardinality constraint 
xi + • • • + x n < r. Hint: Show that they imply 5^=1 whenever + • • • +Xj+/c_i > k. 

27. [20] Similarly, prove the correctness of Bailleux and Boufkhad’s ( 20 ) and ( 21 ). 
Hint: They imply bj=l whenever the leaves below node k contain j or more Is- 

► 28• [20] What clauses result from ( 18 ) and ( 19 ) when we want to ensure that x\ + 
• • • + x n > 1? (This special case converts arbitrary clauses into 3SAT clauses-) 

► 29. [20] Instead of the single constraint x\ H - hx n < r, suppose we wish to impose 

a sequence of constraints xi + • • • + 而 < ri for 1 < i < n- Can this be done nicely with 
additional clauses and auxiliary variables? 

► 30. [22] If auxiliary variables s'- are used as in ( 18 ) and ( 19 ) to make - Vx n < r, 

while s’j k are used to make xi + • • • + < n — r, show that we may unify them by 

taking = 5 ^ for 1 < j < n — r, 1 < A: < r- Can ( 20 ) and ( 21 ) be similarly unified? 

► 31. [28] Let Ft{r) be the smallest n for which there is a bit vector xi .. .x n with 

xi + • • • + = r and with no t equally spaced Is. For example, ^(12) = 30 because 

of the unique solution 101100011010000000010110001101. Discuss how Ft(n) might be 

computed efficiently with the help of a SAT solver• 

32. [15] A list coloring is a graph coloring in which v’s color belongs to a given 
set L(v)^ for each vertex v. Represent list coloring as a SAT problem. 

33. [21] A double coloring of a graph is an assignment of two distinct colors to every 
vertex in such a way that neighboring vertices share no common colors- Similarly, a g - 
tuple coloring assigns q distinct colors to each vertex. Find double and triple colorings 
of the cycle graphs C 5 , CV, Cq , …， using as few colors as possible. 

34. [HM26] The fractional coloring number x*((7) of a graph G is defined to be the 
minimum ratio p/q for which G has a ^-tuple coloring that uses p colors. 

a) Prove that S X(G)，and show that equality holds in McGregor’s graphs. 

b) Let Si ， … ， Sn be all the independent subsets of G^s vertices. Show that 

= min {Ai + ••• + Aat | X^f =1 ^j[v G ] = 1 for all vertices v}. 

A 1 ，…， A jy 2 0 

(This is a fractional exact cover problem.) 

c) What is the fractional coloring number x*(Cn) of the cycle graph (7 n ? 
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d) Consider the following greedy algorithm for coloring G: Set k <— 0 and Go G; 
while Gk is nonempty, set k <— k + 1 and Gk Gk-i \Ck, where Ck is a maximum 
independent set of Gk—i* Prove that h < (G)，where a{G) is the size of G’s 

largest independent set; hence x{G)/x^{G) < H a ( G ) = 0(log n). Hint: Let t v = 
1/|(7^| if i; G Ci, and show that J2 v es — ^\s\ whenever S is an independent set- 

35. [22] Determine when G is (a) the graph of the contiguous United States 

(see 7 -( 17 ) and exercise 7-45); (b) the graph of exercise 22. 

► 36. [22] A radio coloring of a graph, also known as an 1/(2,1) labeling, is an assign¬ 
ment of integer colors to vertices so that the colors of u and v differ by at least 2 
when u —— v, and by at least 1 when u and v have a common neighbor. (This notion ， 
introduced by Fred Roberts in 1988, was motivated by the problem of assigning channels 
to radio transmitters, without interference from “dose” transmitters and without strong 
interference from “very dose” transmitters.) Find a radio coloring of Fig. 33 that uses 
only 16 consecutive colors. 

37. [20] Find an optimum radio coloring of the contiguous USA graph (see 7 -( 17 )). 

38• [M25] How many consecutive colors are needed for a radio coloring of (a) the 
n x n square grid P n uP n l (b) the vertices {(x^y^z) \ x^y^z > 0, x + y+ z = n } 5 which 
form a triangular grid with n + 1 vertices on each side. 

39• [M 46 ] Find an optimum radio coloring of the n-cube，for some value of n > 6 . 

40• [01] Is the factorization problem ( 22 ) unsatisfiable whenever 2 ： is a prime number? 

41. [M21] Determine the number of Boolean operations 八， V, ㊉ needed to multiply 
m-bit numbers by n-bit numbers with Dadda’s scheme, when 2 < m < n. 

42. [21] Tseytin encoding analogous to ( 24 ) can be devised also for ternary opera- 
tions, without introducing any additional variables besides those of the function being 
encoded. Illustrate this principle by encoding the basic operations a : 卜 t ㊉ 乜 ㊉ r and 
y (tuv) of a full adder directly，instead of composing them from ㊉ ，八 , and V. 

► 43. [21] For which integers n > 2 do there exist odd palindromic binary numbers 
x = (x n • ^ Xi )2 = (xi ... x n )2 and y = (y n ... y x ) 2 = (?/i • • • ynh such that their 
product xy = (z m ^ n - - - ^ 1)2 = (zi ... Zm^-n )2 is also palindromic? 

► 44. [30] (Maximum ones.) Find the largest possible value of vx + vy + v{xy)^ namely 
the greatest total number of 1 bits, over all multiplications of 32-bit binary x and y. 

45. [20] Specify clauses that constrain ( 2 ^ • • • 2 : 1)2 to be a perfect square. 

46. [30] Find the largest perfect square less than 2 100 that is a binary palindrome- 

► 47. [20] Suppose a circuit such as Fig. 34 has m outputs and n inputs，with g gates 
that transform two signals into one and h gates that transform one signal into two. 
Find a relation between g and by expressing the total number of wires in two ways. 

48. [20] The small circuit shown here has three inputs, three XOR gates, 
one fanout gate, eight wires, and one output- Which single - stuck-at faults are 
detected by each of the eight test patterns pqrl 

49. [24] Write a program that determines exactly which of the 100 single- 
stuck - at faults of the circuit in Fig. 34 are detected by each of the 32 possible 
input patterns. Also find all the minimum sets of test patterns that will 
discover every such fault (unless it’s not detectable)- 
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50. [24] Demonstrate Larrabee’s method of representing stuck-at faults by describing 
the clauses that characterize test patterns for the fault u xl stuck at 1” in Fig. 34. (This 
is the wire that splits off of X 2 and feeds into xl and then to 62 and 63 ； see Table 1.) 

51. [40] Study the behavior of SAT solvers on the problem of finding a small number of 
test patterns for all of the detectable single-stuck-at faults of the circuit prod (32, 32). 
Can a complete set of patterns for this large circuit be discovered “automatically” 
(without relying on number theory)? 

52. [15] What clauses correspond to ( 29 ) and ( 30 ) when the second case on the left 
of Table 2, /(l, 0,1, 0, … ， 1) = 1, is taken into account? 

► 53. [M20] The numbers in Table 2 are definitely nonrandom. Can you see why? 

► 54. [23] Extend Table 2 using the rule in the previous exercise- How many rows are 
needed before f(x) has no M-term representation in DNF, when M = 3, 4, and 5? 

55. [21] Find an equation analogous to ( 27 ) that is consistent with Table 2 and has 
every variable complemented. (Thus the resulting function is monotone decreasing.) 

► 56. [22] Equation ( 27 ) exhibits a function matching Table 2 that depends on only 8 
of the 20 variables. Use a SAT solver to show that we can actually find a suitable / 
that depends on only five of the Xj. 

► 57. [29] Combining the previous exercise with the methods of Section 7.1-2, exhibit 
a function f for Table 2 that can be evaluated with only six Boolean operations(!). 

► 58. [20] Discuss adding the clauses pij V qij to ( 29 ) ，（ 30 )，and ( 31 )- 

A 

59. [M20] Compute the exact probability that f(x) in ( 32 ) differs from f(x) in ( 27 ). 

60. [24] Experiment with the problem of learning f(x) in ( 27 ) from training sets of 

八 

sizes 32 and 64. Use a SAT solver to find a conjectured function, f(x); then use BDD 

A 

methods to determine the probability that this f(x) differs from f(x) for random x. 

61. [20] Explain how to test when a set of clauses generated from a training set via 
( 29 )—( 31 ) is satisfiable only by the function f(x) in ( 27 ). 

62. [23] Try to learn a secret small-DNF function with iV-bit training sets x ⑼， x^\ 
x( 2 \ where is random but each bit of ㊉ x*— 1 ) for A: > 0 is 1 with 
probability p. (Thus, if p is small, successive data points will tend to be near each 
other.) Do such sets turn out to be more efficient in practice than the purely random 
ones that arise for p = 1 / 2 ? 

► 63. [20] Given an n-network a = [ii : ji][i 2 : j‘ 2 ] • • • - jr], as defined in the exercises 

for Section 5.3.4, explain how to use a SAT solver to test whether or not a is a sorting 
network. Hint: Use Theorem 5.3.4Z. 

A 

64. [26] The exact minimum time T(n) of a sorting network for n elements is a famous 

八 

unsolved problem, and the fact that T(9) = 7 was first established in 1987 by running a 
highly optimized program for many hours on a Cray 2 supercomputer* 

Show that this result can now be proved with a SAT solver in less than a second(!)• 

► 65. [28] Describe encodings of the Life transition function ( 35 ) into clauses. 

a) Use only the variables x\j and 

b) Use auxiliary variables as in the Bailleux and Boufkhad encoding ( 20 )—( 21 ), shar¬ 
ing intermediate results between neighboring cells as discussed in the text. 

66 . [24] Use a SAT solver to find short counterparts to Fig. 35 in which (a) Xi = LIFE; 
(b) X 2 = LIFE. In each case Xo should have the smallest possible number of live cells. 
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67. [24] Find a mobile chessboard path Xo ^ Xi X 21 with no more than five 

cells alive in each Xt. (The glider in ( 37 ) leaves the board after X 20 O How about X 22 ? 

68 . [39] Find a maximum-length mobile path in which 6 to 10 cells are always alive- 

69. [23] Find all (a) still lifes and (b) oscillators of period > 1 that live in a 4x4 board. 

70. [21] The live cells of an oscillator are divided into a rotor (those that change) and 
a stator (those that stay alive) - 

a) Show that the rotor cannot be just a single cell. 

b) Find the smallest example of an oscillator whose rotor is m e 

c) Similarly, find the smallest oscillators of period 3 whose rotors have the following 
forms: Fh —>• Rh Bn RnI Pb ^ Hi ~^ In ^ li P% ^ Sh li • 

► 71. [22] When looking for sequences of Life transition on a square grid，an asymmet¬ 
rical solution will appear in eight different forms, because the grid has eight different 
symmetries. Furthermore, an asymmetrical periodic solution will appear in 8 r different 
forms, if r is the length of the period. 

Explain how to add further clauses so that essentially equivalent solutions will 
occur only once: Only “canonical forms” will satisfy the conditions. 

72. [28] Oscillators of period 3 are particularly intriguing, because Life seems so 
inherently binary. 

a) What are the smallest such oscillators (in terms of bounding box)? 

b) Find period-3 oscillators of sizes 9 x n and 10 x n, with n odd, that have “fourfold 
symmetry” ： The patterns are unchanged after left-right and/or up-down reflection. 
(Such patterns are not only pleasant to look at, they also are much easier to find, 
because we need only consider about one-fourth as many variables.) 

c) What period-3 oscillators with fourfold symmetry have the most possible live cells ， 
on grids of sizes 15 x 15, 15 x 16, and 16 x 16? 

d) The period-3 oscillator shown here has another kind of four-way sym- 
metry，because it’s unchanged after 90° rotation. (It was discovered in 
1972 by Robert Wainwright, who called it “snake dance” because its 
stator involves four snakes.) What period-3 oscillators with 90° symmetry have 
the most possible live cells, on grids of sizes 15 x 15 and 16 x 16? 

► 73. [21] (Mobile flip flops.) An oscillator of period 2 is called a flip flop ， and the Life 
patterns of mobile flipflops are particularly appealing: Each cell is either blank (dead 
at every time t) or type A (alive when t is even) or type B (alive when t is odd). Every 
nonblank cell (i) has exactly three neighbors of the other type，and (ii) doesn’t have 
exactly two or three neighbors of the same type. 

a) The blank cells of a mobile flipflop also satisfy a special condition. What is it? 

b) Find a mobile flipflop on an 8 x 8 grid, with top row H ~k 齙 bH~ . 

c) Find patterns that are mobile flipflops on m x n toruses for various m and n. 
(Thus, if replicated indefinitely，each one will tile the plane with an infinite mobile 
flipflop.) Hint: One solution has no blank cells whatsoever; another has blank 
cells like a checkerboard. 

74. [M28] Continuing the previous exercise, prove that no nonblank cell of a finite 
mobile flipflop has more than one neighbor of its own type. (This fact greatly speeds 
up the search for finite mobile flipflops.) Can two type A cells be diagonally adjacent? 

75. [M22] (Stephen Silver ， 2000.) Show that a finite，mobile oscillator of period p >3 
must have some cell that is alive more than once during the cycle. 
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76. [41] Construct a mobile Life oscillator of period 3- 

77. [20] “Step X-i ，” which precedes Xo in ( 38 )，has the glider configuration ® instead 
of HB- What conditions on the still life X 5 will ensure that state Xo is indeed reached? 
(We don’t want digestion to begin prematurely.) 

78. [21] Find a solution to the four-step eater problem in ( 38 ) that works on a 7 x n 
grid, for some n, instead of 8 x 8 . 

79. [23] What happens if the glider meets the eater of ( 39 ) in its opposite phase 
(namely 9 instead of BJ)? 

80. [21] To counteract the problem in the previous exercise, find an eater that is 
symmetrical when reflected about a diagonal，so that it eats both 3 and Bl- (You’ll 
have to go larger than 8 x 8 , and you’ll have to wait longer for digestion.) 

81. [21] Conway discovered a remarkable “spaceship，” where X 4 is Xo shifted up 2: 


Xo = 


圍圍 j 圍 4圍=〜 

rm U I m I I n n 1 I ININ INN 


Is there a left-right symmetrical still life that will eat such spaceships? 

► 82. [22] (Light speed.) Imagine Life on an infinite plane, with all cells dead at time 0 
except in the lower left quadrant- More precisely, suppose Xt = (xtij) is defined for all 
t > 0 and all integers —00 < i，j < + 00 , and that xoij = 0 whenever i > 0 or j > 0- 

a) Prove that xtij = 0 whenever 0 < t < max(i, j). 

b) Furthermore xtij = 0 when 0 < —i < j and 0 < t < i + 2j. 

c) And xtij = 0 for 0 < t < 2i + if i > 0 and j > 0. Hint: If xtij = 0 whenever 
i > —j^ prove that xtij = 0 whenever i > —j. 


83. [21] According to the previous exercise, the earliest possible time that cell (i,j) 
can become alive, if all initial life is confined to the lower left quadrant of the plane, is 


at least 

For example，when |i| < 5 and \j\ <5 the values of 
are shown at the right. 

Let fUj) be the actual minimum time at which cell 
(i ， j) can be alive，for some such initial state. Devise a set 
of clauses by which a SAT solver can test whether or not 
/*(io, jo) = f(i(h jo), given io and jo. (Such clauses make 
interesting benchmark tests.) 


5 6 7 8 9 101214 1618 20 
4 4 5 6 7 8 1012141618 
3 3 3 4 5 6 8 10121416 
2 2 2 2 3 4 6 8 10 1214 
111112468 10 12 
0000002468 10 
00000013579 
00000012468 
00000012357 
00000012346 
00000012345 


84. [33] Prove that fUj) = /(i ， j) in the following cases when j > 0: (a) i = j, 
< =j + 1 ， and i = j — 1. (b) i = 0 and i = — 1 . (c) i = 1 — j. (d) i = j — 2. (e) i = —2. 


► 85. [39] A Garden of Eden is a state of Life that has no predecessor. 

a) If the pattern of 92 cells illustrated here occurs anywhere within a 
bitmap X, verify that X is a Garden of Eden. (The gray cells can be 
either dead or alive.) 

b) This “orphan” pattern, found with a SAT solver’s help, is the smallest 
that is currently known. Can you imagine how it was discovered? 

86 . [M23] How many Life predecessors does a random 10x10 bitmap have，on average? 

87. [21] Explain why the clauses ( 42 ) represent Alice and Bob’s programs ( 40 )，and 
give a general recipe for converting such programs into equivalent sets of clauses. 
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88 . [18] Satisfy ( 41 ) and ( 42 ) for 0 < t < 6 , and the 20 x 6 additional binary clauses 
that exclude multiple states, along with the “embarrassing” unit clauses (A36) 八 （ B36). 

89. [21] Here’s a mutual-exclusion protocol once recommended in 1966. Does it work? 

AO. Maybe go to Al. BO. Maybe go to Bl. 

Al. Set a f 1, go to A2. Bl. Set 6 t 1, go to B2, 

A2* If l go to A3, else to A5. B2* If l go to B5, else to B3- 

A3. If b go to A3, else to A4. B3. If a go to B3, else to B4. 

A4. Set Z 4— 0, go to A2. B4* Set Z 4- 1, go to B2. 

A5 - Critical, go to A6. B5 - Critical, go to B6. 

A6. Set a 0, go to A0. B6. Set 6 0, go to B0. 

90. [20] Show that ( 43 ) ，（ 45 )， and ( 46 ) permit starvation，by satisfying ( 47 ) and ( 48 ). 

91. [M21] Formally speaking, Alice is said to “starve” if there is (i) an infinite se¬ 
quence of transitions Xo Xi ^ starting from the initial state 叉 0 , and (ii) an in¬ 
finite sequence ◎() ， @ 1 ， • • • of Boolean “bumps” that changes infinitely often, such that 
(iii) Alice is in a “maybe” or “critical” state only a finite number of times. Prove that 
this can happen if and only if there is a starvation cycle ( 47 ) as discussed in the text- 

92. [20] Suggest 0(r 2 ) clauses with which we can determine whether or not a mutual 

exclusion protocol permits a path Xo Xi of distinct states. 

93. [20] What clauses correspond to the termin ( 51 )? 

► 94. [21] Suppose we know that (Xo -> Xi ^ ) A is unsatisfiable for 

0 < r < A:. What clauses will guarantee that $ is invariant? (The case k = 1 is ( 51 ).) 

95. [20] Using invariants like ( 50 ), prove that ( 45 ) and ( 46 ) provide mutual exclusion. 

96. [22] Find all solutions to ( 52 ) when r = 2. Also illustrate the fact that invariants 
are extremely helpful, by finding a solution with distinct states 為， X\^ … ， X r and 
with r substantially greater than 2 , if the clauses involving $ are removed- 

97. [20] Can states A 6 and B 6 occur simultaneously in Peterson’s protocol ( 49 )? 

► 98. [M23] This exercise is about proving the nonexistence of starvation cycles ( 47 )- 

a) A cycle of states is called “pure” if one of the players is never bumped, and “simple” 
if no state is repeated. Prove that the shortest impure cycle, if any, is either simple 
or consists of two simple pure cycles that share a common state. 

b) If Alice is starved by some cycle with protocol ( 49 )，we know that she is never in 
states A0 or A5 within the cycle. Show that she can’t be in Al, A 2 , or A 6 either. 

c) Construct clauses to test whether there exist states Xo X\ —)►•••—)► , with 

Xo arbitrary, such that (X 0 X 1 ... X^-i) is a starvation cycle for some k < r. 

d) Therefore we can conclude that ( 49 ) is starvation-free without much extra work. 

99. [25] Th. Dekker devised the first correct mutual-exclusion protocol in 1965: 


A0. Maybe go to Al. 

Al. Set a [ 1, go to A2* 

A2. If b go to A3, else to A6. 
A3. If l go to A4, else to A2. 
A4. Set a 0, go to A5. 

A5 - If l go to A5, else to Al. 
A6. Critical, go to A7. 

A7. Set Z 1, go to A8. 

A8. Set a 0, go to A0. 


B0. Maybe go to Bl. 

Bl. Set 6 4- 1, go to B2. 

B2* If a go to B3, else to B6. 
B3- If l go to B2, else to B4. 
B4. Set 6 0, go to B5. 

B5 - If l go to Bl, else to B5- 
B6. Critical, go to B7- 
B7- Set Z 0, go to B8. 

B8. Set 6 0, go to B0. 


Use bounded model checking to verify its correctness- 
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100. [22] Show that the following protocol can starve one player but not the other: 

BO. Maybe go to Bl. 

Bl. Set b i— 1, go to B2. 

B2. If a go to B3, else to B5. 

B3. Set 6 - 0, go to B4. 

B4. If a go to B4, else to Bl. 

B5. Critical, go to B6. 

B6. Set 6 0, go to BO. 

► 101. [31] Protocol ( 49 ) has the potential defect that Alice and Bob might both be 
trying to set the value of l at the same time. Design a mutual-exclusion protocol in 
which each of them controls two binary signals, visible to the other. Hint: The method 
of the previous exercise can be enclosed in another protocol. 

102 . [22] If Alice is setting a variable at the same time that Bob is trying to read 
it, we might want to consider a more stringent model under which he sees either 0 
or 1 , non deterministically- (And if he looks k times before she moves to the next 
step, he might see 2 k possible sequences of bits.) Explain how to handle this model of 
“flickering” variables by modifying the clauses of exercise 87. 

103. [18] (Do this exercise by hand, it’s fun!) Find the 7x21 image whose tomographic 
sums are (n，• • • ， ” 7 ) = (1 ， 0,13, 6 ,12, 7, 19); (ci ， •… ， c‘ 2 i) = (4, 3, 3,4,1 ， 6 ,1 ， 3, 3, 3, 5,1 ， 
1, 5,1 ， 5,1 ， 5,1 ， 1 ， 1); (ai ， ... ， a27) = (0, 0,1 ， 2, 2,3, 2, 3,3, 2,3,3,4,3,2, 3,3, 3,4, 3, 2,2,1 ， 
1 ， 1 ， 1 ， 1); ( 〜， •… ， 6 27 ) = (0, 0, 0,0, 0,1 ， 3, 3,4, 3, 2, 2, 2, 3, 3, 4, 2, 3, 3, 3, 3, 3,4, 3, 2,1 ， 1). 

104. [M21] For which m and n is it possible to satisfy the digital tomography problem 
with ad = bd = 1 for 0 < d < m + n? (Equivalently, when can m + n — 1 nonattacking 
bishops be placed on an m x n board?) 

► 105. [M28] A matrix whose entries are { —1 ， 0 ， +1} is tomographically balanced if its 
row, column, and diagonal sums are all zero- Two binary images X = (Xij) and 
X r = (x[j) clearly have the same row, column, and diagonal sums if and only if X — X f 
is tomographically balanced. 

a) Suppose Y is tomographically balanced and has m rows, n columns，and t occur¬ 
rences of +1. How many m x n binary matrices X and X r satisfy X — X r = Y? 

b) Express the condition U Y is tomographically balanced” in terms of clauses, with 
the values { — 1 , 0 , + 1 } represented respectively by the 2 -bit codes { 10 , 00 , 01 }. 

c) Count the number T(m,n) of tomographically balanced matrices，for m^n < 8. 

d) How many such matrices have exactly four occurrences of +1? 

e) At most how many +ls can a 2n x 2n tomographically balanced matrix have? 

f) True or false: The positions of the +ls determine the positions of the —Is. 

106. [M20] Determine a generous upper bound on the possible number of different 
sets of input data {ri^Cj^ad^bd} that might be given to a 25 x 30 digital tomography 
problem, by assuming that each of those sums independently has any of its possible 
values* How does this bound compare to 2 750 ? 

► 107. [22] Basket weavers from the Tonga culture of Inhambane, Mozambique, have 
developed appealing periodic designs called “gipatsi patterns” such as this: 

(Notice that an ordinary pixel grid has been rotated by 45°.) Formally speaking, a 
gipatsi pattern of period p and width n is a p x n binary matrix (xij) in which we have 


A0. Maybe go to Al. 

Al. Set a 1, go to A2, 

A2. If b go to A2, else to A3. 
A3- Critical, go to A4. 

A4. Set a 0, go to A0. 
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Xi^i = Xi^n =1 for 1 < i < p. Row i of the matrix is to be shifted right by i — 1 places 
in the actual pattern. The example above has p = 6 ， n = 13， and the first row of its 
matrix is 1111101111101- Such a pattern has row sums Vi — X ^ =1 Xij for 1 < i < p and 
column sums Cj = x i，j for 1 < j < n, as usual By analogy with ( 53 )，it also has 

ad = [ Xij, I <d<p; b d = [ Xij, l<d<2p. 

i+j 三 d (modulop) 2i-{-j=d (modulo 2p) 


a) What are the tomographic parameters ri, Cj ， ad, and bd in the example pattern? 

b) Do any other gipatsi patterns have the same parameters? 

108. [23] The column sums Cj in the previous exercise are somewhat artificial，because 
they count black pixels in only a small part of an infinite line. If we rotate the grid at 
a different angle, however, we can obtain infinite periodic patterns for which each of 
Fig. 36’s four directions encounters only a finite number of pixels. 

Design a pattern of period 6 in which parallel lines always have equal tomographic 
projections, by changing each of the gray pixels in the following diagram to either white 


or black: 



► 109. [20] Explain how to find the lexicographically smallest solution x\ .. .x n to a 
satisfiability problem, using a SAT solver repeatedly- (See Fig. 37(a)-) 

110. [19] What are the lexicographically (first ， last) solutions to waerden(3^ 10; 96)? 

111. [40] The lexicographically first and last solutions to the “Cheshire Tom” problem 
in Fig. 37 are based on the top-to-bottom - and-left-to-right ordering of pixels. Experi¬ 
ment with other pixel orderings — for example, try bottom-to-top-and-right-to-left. 

112. [ 46 ] Exactly how many solutions does the tomography problem of Fig. 36 have? 

► 113. [30] Prove that the digital tomography problem is NP-complete, even if the 

marginal sums r ， c, a, 6 are binary: Show that an efficient algorithm to decide whether 
or not an n x n pixel image (xij) exists，having given 0—1 values of Xij^ 

Cj = Y^i x ih a d = Yji+j=d+i and bd = Y^i-j=d-n could be used to solve the 
binary tensor contingency problem of exercise 212 (a). 

114. [27] Each cell (i ， j) of a given rectangular grid either contains a land mine 
(xij = 1) or is safe (xij = 0) - In the game of Minesweeper, you are supposed to identify 
all of the hidden mines, by probing locations that you hope are safe: If you decide to 
probe a cell with Xij = 1, the mine explodes and you die (at least virtually). But if 
Xij = 0 you Ye told the number mj of neighboring cells that contain mines, 0 < riij < 8 , 
and you live to make another probe. By carefully considering these numeric clues, you 
can often continue with completely safe probes，eventually touching every mine-free cell. 

For example, suppose the hidden mines happen to match the 25 x 30 pattern of 
the Cheshire cat (Fig. 36)，and you start by probing the upper right corner. That cell 
turns out to be safe, and you learn that m ， 3 o = 0 ; hence it’s safe to probe all three 
neighbors of (1 ， 30). Continuing in this vein soon leads to illustration (a) below, which 
depicts information about cells (i,j) for 1 < i < 9 and 21 < j < 30; unprobed cells are 
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shown in gray，otherwise the value of riij appears- From this data it’s easy to deduce 
that $i,24 = x‘2,24 = X3,25 = $4,25 = • • • = ^9,26 = 1 ； you^ll never want to probe in those 
places, so you can mark such cells with X， arriving at state (/3) since 713,24 = ns, 25 = 4. 
Further progress downward to row 17, then leftward and up, leads without difficulty to 
state ( 7 ). (Notice that this process is analogous to digital tomography, because you’re 
trying to reconstruct a binary array from information about partial sums.) 


■■■■200000 
■■■■310000 
■■■■■20000 
■■■■■31000 
=■■■■■2000 
■■■■■■3000 
■■■■■■3000 
■■■■■■3000 
■■■■■■3100 


(« = 


■■X200000 
■_X310000 
■■4X20000 
■■■X31000 
■■■4X2000 
■■■■X3000 
■■■■X3000 
■■■■X3000 
■■■■X3100 


01LJX200000 
12DX310000 
2X 4X20000 
{ \ 5XHLIX31000 

( 7 ) = XX1B4X2000 
5XHBJX3000 
3X JBX3000 
2X424X3000 
12X23X3100 


a) Now find safe probes for all thirteen of the cells that remain gray in ( 7 ). 

b) Exactly how much of the Cheshire cat can be revealed without making any unsafe 
guesses, if you^re told in advance that (i) xi，i =0? (ii) xi^o = 0? (iii) ^ 25,1 = 0? 
(iv) a ： 25,30 = 0? (v) all four corners are safe? Hint: A SAT solver can help- 

115. [25] Empirically estimate the probability that a 9 x 9 game of Minesweeper, with 

10 randomly placed mines, can be won with entirely safe probes after the first guess. 

116. [22] Find examples of Life flipflops for which X and X f are tomographically 

equal. 

117. [23] Given a sequence x = xi ... x n ， let v^x = X 1 X 2 + X 2 X^ + • • • + x n -ix n . 

(A similar sum appears in the serial correlation coefficient, 3.3.2—( 23 ).) 

a) Show that, when x is a binary sequence, the number of runs of Is in x can be 
expressed in terms of vx and x. 

b) Explain how to encode the condition < r as a set of clauses, by modifying 

the cardinality constraints ( 20 )—( 21 ) of Bailleux and Boufkhad. 

c) Similarly, encode the condition x > r. 

118. [20] A tatami tiling is a covering by dominoes in which no three share a corner: 



(Notice that is disallowed，but would be fine.) Explain how to use a SAT solver 
to find a tatami tiling that covers a given set of pixels, unless no such tiling exists. 


119. [18] Let F = waerden(3^ 3; 9) be the 32 clauses in (g). For which literal l is the 
reduced formula F \ l smallest? Exhibit the resulting clauses* 

120. [M20] True or false: F\L = {C\L\CeF^ndCDL = d}},ifL = {l\leL}. 

121 . [21] Spell out the changes to the link fields in the data structures, by expanding 
the higher-level descriptions that appear in steps A3, A4, A7, and A 8 of Algorithm A. 

► 122. [21] Modify Algorithm A so that it finds all satisfying assignments of the clauses. 

123. [17] Show the contents of the internal data structures L, START, and LINK when 
Algorithm B or Algorithm D begins to process the seven clauses R r of ( 7 ). 
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► 124. [21] Spell out the low-level link field operations that are sketched in step B3. 

► 125. [20] Modify Algorithm B so that it finds all satisfying assignments of the clauses. 
126. [20] Extend the computation in ( 59 ) by one more step- 

127* [17] What move codes mi … md correspond to the computation sketched in ( 59 ), 
just before and after backtracking occurs? 

128. [19] Describe the entire computation by which Algorithm D proves that Rivest’s 
clauses ( 6 ) are unsatisfiable, using a format like ( 59 ). (See Fig. 39.) 

129. [20] In the context of Algorithm D, design a subroutine that, given a literal 
returns 1 or 0 according as l is or is not being watched in some clause whose other 
literals are entirely false- 

130. [22] What low-level list processing operations are needed to “dear the watch list 
for x/c” in step D 6 ? 

► 131. [30] After Algorithm D exits step D3 without finding any unit clauses, it has 
examined the watch lists of every free variable. Therefore it could have computed the 
lengths of those watch lists，with little additional cost; and information about those 
lengths could be used to make a more informed decision about the variable that’s chosen 
for branching in step D4. Experiment with different branching heuristics of this kind. 

► 132. [22] Theorem 7-1-1K tells us that every 2SAT problem can be solved in linear 
time. Is there a sequence of 2SAT clauses for which Algorithm D takes exponential time? 

► 133. [25] The size of a backtrack tree such as Fig. 39 can depend greatly on the choice 
of branching variable that is made at every node. 

a) Find a backtrack tree for waerden(3^ 3; 9) that has the fewest possible nodes. 

b) What’s the largest backtrack tree for that problem? 

134. [22] The BIMP tables used by Algorithm L are sequential lists of dynamically 
varying size- One attractive way to implement them is to begin with every list having 
capacity 4 (say); then when a list needs to become larger, its capacity can be doubled. 

Adapt the buddy system (Algorithm 2.5R) to this situation. (Lists that shrink 
when backtracking needn’t free their memory，since they’re likely to grow again later.) 

► 135. [16] The literals l f in BIMP (Z) are those for which l — y l f in the “implication 
digraph” of a given satisfiability problem. How can we easily find all of the literals l n 
such that l n ― >1 ， given l? 

136. [15] What pairs will be in TIMP (3) ， before and after X 5 is set to zero with respect 
to the clauses (g) of waerden(3^ 3; 9)，assuming that we are on decision level d = 0? 

1S7. [24] Spell out in detail the processes of (a) removing a variable X from the free 
list and from all pairs in TIMP lists (step L7 of Algorithm L)，and of (b) restoring it 
again later (step L12). Exactly how do the data structures change? 

► 138. [20] Discuss what happens in step L9 of Algorithm L if we happen to have both 
v G BIMP ( 丑 ） and u G BIMP(v). 

139. [25] (Compensation resolvents.) If w ^ BIMP(v)，the binary clause u\/ v implies 
the binary clause u\J w, because we can resolve uM v with v\/ w. Thus step L9 could 
exploit each new binary clause further，by appending w as well as v to BIMP ( 丑 ）， for all 
such w. Discuss how to do this efficiently. 
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140. [21] The FORCE ， BRANCH, BACKF，and BACKI arrays in Algorithm L will obviously 
never contain more than n items each. Is there a fairly small upper bound on the 
maximum possible size of ISTACK? 

141. [18] Algorithm L might increase ISTAMP so often that it overflows the size of the 
IST(D fields. How can the mechanism of ( 63 ) avoid bugs in such a case? 

142. [24] Algorithms A, B，and D can display their current progress by exhibiting 
a sequence of move codes mi - . - such as ( 58 ) and ( 60 ); but Algorithm L has no 
such codes. Show that an analogous sequence rrn • • • tuf could be printed in step L2, 
if desired. Use the codes of Algorithm D; but extend them to show rrij = 6 (or 7) if 
Rj-i is a true (or false) literal whose value was found to be forced by Algorithm X, or 
forced by being a unit clause in the input. 

► 143• [30] Modify Algorithm L so that it will apply to nonempty clauses of any size. 
Call a clause big if its size is greater than 2. Instead of TIMP tables, represent every big 
clause by ‘KINX’ and ‘CINX’ tables: Every literal l has a sequential list KINX(/) of big 
clause numbers; every big clause c has a sequential list CINX (c) of literals; c is in KINX(0 
if and only if l is in CINX (c). The current number of active clauses containing l is indi¬ 
cated by KSIZE(D; the current number of active literals in c is indicated by CSIZE(c) - 

144. [15] True or false: If l doesn’t appear in any clause, h’ （ l) = 0-1 in ( 65 )- 

145. [23] Starting with h{l) = 1 for each of the 18 literals l in waerden(3^ 3; 9), find 
successively refined estimates h’ （ l) ， h" (J), • • • ， using ( 65 ) with respect to the 32 ternary 
clauses (g). Then, assuming that X 5 has been set false as in exercise 136, and that the 
resulting binary clauses 13, 19 ， 28, 34, 37 ， 46, 67， 79 have been included in the BIMP 
tables, do the same for the 16 literals that remain at depth d = 1. 

146. [25] Suggest an alternative to ( 64 ) and ( 65 ) for use when Algorithm L has been 
extended to nonternary clauses as in exercise 143. (Strive for simplicity.) 

147. [05] Evaluate C max in ( 66 ) for d = 0, 1 ， 10, 20, 30, using the default Co and C\. 

148. [21] Equation ( 66 ) bounds the maximum number of candidates using a formula 
that depends on the current depth but not on the total number of free variables. 
The same cutoffs are used in problems with any number of variables. Why is that a 
reasonable strategy? 

► 149. [26] Devise a data structure that makes it convenient to tell whether a given 
variable x is a “participant” in Algorithm L. 

150. [21] Continue the text’s story of lookahead in waerden (3, 3; 9): What happens at 
depth d = 1 when l 7 and T 卜 22 (see ( 70 ))，after literal 4 has become proto true? 
(Assume that no double-lookahead is done.) 

► 151. [26] The dependency digraph ( 68 ) has 16 arcs, only 8 of which are captured in the 
subforest ( 69 )• Show that, instead of ( 70 )，we could actually list the literals l and give 
them offsets o{l) in such a way that u appears before v in the list and has o{u) > o(v) if 
and only if v — >u in ( 68 ) - Thus we could capture all 16 dependencies via levels of truth* 

152. [22] Give an instance of 3SAT for which no free “participants” are found in 
step X3, yet all clauses are satisfied. Also describe an efficient way to verify satisfaction. 

153. [17] What’s a good way to weed out unwanted candidates in step X3, if C >(7 max ? 

154. [20] Suppose we’re looking ahead with just four candidate variables ， {a ， 6 ， c ， d }， 
and that they’re related by three binary clauses (aV 6 ) A (aVc) A (cVd). Find a subforest 
and a sequence of truth levels to facilitate lookaheads, analogous to ( 69 ) and ( 70 ) • 
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7.2.2.2 


155. [32] Sketch an efficient way to construct the lookahead forest in step X4. 

156. [05] Why is a pure literal a special case of an autarky? 

157. [10] Give an example of an autarky that is not a pure literal. 

158. [15] If / is a pure literal, will Algorithm X discover it? 

159. [Ml 7] True or false: (a) A is an autarky for F if and only if F \ A C F. (b) If 
A is an autarky for F and A f C then A\ A f is an autarky for F\A f . 

160. [18] (Black and white principle.) Consider any rule by which literals have been 
colored white ， black, or gray in such a way that l is white if and only if l is black. (For 
example, we might say that l is white if it appears in fewer clauses than l.) 

a) Suppose every clause of F that contains a white literal also contains a black 
literal- Prove that F is satisfiable if and only if its all-gray clauses are satisfiable- 

b) Explain why this metaphor is another way to describe the notion of an autarky. 

► 161. [21] (Black and blue principle.) Now consider coloring literals either white ， black, 
orange, blue，or gray，in such a way that l is white if and only if l is black, and l is 
orange if and only if l is blue- (Hence l is gray if and only if l is gray.) Suppose further 
that F is a set of clauses in which every clause containing a white literal also contains 
either a black literal or a blue literal (or both). Let A = {ai, • • •, a p } be the black 
literals and let L = {Zi ， • • • ,G} be the blue literals- Also let F r be the set of clauses 
obtained by adding p additional clauses (『i V • • • V 匕 V aj) to F, for 1 < j < p. 

a) Prove that F is satisfiable if and only if F f is satisfiable. 

b) Restate and simplify that result in the case that p = 1. 

c) Restate and simplify that result in the case that q = 1. 

d) Restate and simplify that result in the case that p = q = 1. (In this special case, 

(l V a) is called a blocked binary clause.) 

162. [21] Devise an efficient way to discover all of the (a) blocked binary clauses (IWa) 
and (b) size-two autarkies A = {a, a’} of a given /cSAT problem F. 

► 163. [M25] Prove that the following recursive procedure R(F) will solve any n-variable 
3SAT problem F with at most O(0 n ) executions of steps Rl ， R2, or R3: 

Rl. [Check easy cases.] If _F = 0, return true. If 0 G F, return false. Otherwise let 
{h, …， G F be a clause of minimum size 5- 

R2. [Check autarkies.] If 5 = 1 or if {l s } is an autarky ， set F F\l s and return to Rl. 
Otherwise if {l s ^ l s -i} is an autarky, set F ^ F\ l s Js-i and return to Rl- 

R3. [Recurse.] If R(F \ l s ) is true，return true. Otherwise set F <— F \l s ^ s s — 1 ， 

and go back to R2. | 

164. [M30] Continuing exercise 163, bound the running time when F is kSAT. 

► 165. [26] Design an algorithm to find the largest positive autarky A for a given F, 
namely an autarky that contains only positive literals. Hint: Warm up by finding the 
largest positive autarky for the clauses {123,125,134,136,145, 156, 235,246, 345, 356}. 

166. [30] Justify the operations of step X9. Hint: Prove that an autarky can be con- 
stmcted，if w ； = 0 after ( 72 ) has been performed- 

► 167. [21] Justify step Xll and the similar use of X12 in step X 6 . 

168. [26] Suggest a way to choose the branch literal l in step L3, based on the heuristic 
scores H{1) that were compiled by Algorithm X in step L2. Hint: Experience shows 
that it’s good to have both H(l) and H{J) large- 
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SATISFIABILITY: EXERCISES 


147 


► 169. [HM30] (T\ Ahmed ， O. Kullmann.) Excellent results have been obtained in 
some problems when the branch variable in step L3 is chosen to minimize the quantity 
丁 、 where T(a ， 6 ) is the positive solution to r"" a + r~ b = 1. (For example, 

r(l,2) = (j> ^ 1.62 and r(V2,V2) = 2 1/v ^ ^ 1.63, so we prefer (1,2) to (V5,v^).) 
Given a list of pairs of positive numbers (ai ， 6 i), •… ， (a s ,b s ), what’s an efficient way 
to determine an index j that minimizes r(aj^bj)^ without computing logarithms? 

170. [25] (Marijn Heule, 2013.) Show that Algorithm L solves 2SAT in linear time. 

171. [20] What is the purpose of DFAIL in Algorithm Y? 

172. [21] Explain why ‘+L0[j] ’ appears in step Y2’s formula for DT. 

173. [40] Use an implementation of Algorithm L to experiment with random 3SAT 
problems such as rand (3, 2062, 500, 314). Examine the effects of such things as (i) dis¬ 
abling double lookahead; (ii) disabling “wraparound，” by changing the cases j = S and 
j = 5" in X7 and Y4 so that they simply go to X 6 and Y3; (iii) disabling the lookahead 
forest, by letting all candidate literals have null PARENT; (iv) disabling compensation 
resolvents in step L9; (v) disabling “windfalls” in ( 72 ); (vi) branching on a random free 
candidate l in L3, instead of using the H scores as in exercise 168; or (vii) disabling all 
lookahead entirely as in “Algorithm L 0 .” 

174* [15] What’s an easy way to accomplish (i) in the previous exercise? 

175. [32] When Algorithm L is extended to nonternary clauses as in exercise 143, how 
should Algorithms X and Y also change? (Instead of using ( 64 ) and ( 65 ) to compute a 
heuristic for preselection, use the much simpler formula in answer 146. And instead of 
using h{u)h{v) in ( 67 ) to estimate the weight of a ternary clause that will be reduced 
to binary, consider a simulated reduced clause of size 5 > 2 to have weight K s ^ 7 s — 2 , 
where 7 is a constant (typically 0 . 2 ).) 

176. [M25] The “flower snark^ J q is a cubic graph with Aq vertices tj , Uj, Vj, Wj , and 
edges tj —— Uj, uj — Vj，Uj — wj, vj — w j+ i, wj — v j+ i, for 1 < j < q, 

with subscripts treated modulo q. Here，for example, are J5 and its line graph L( J5): 



a) Give labels bj, Cj, dj, ej, and fj to the edges of J q , for 1 < j < g. (Thus aj 
denotes tj —— tj+i and bj denotes tj —— Uj, etc.) What are the edges of L{J q )l 

b) Show that x{Jq) = 2 and x(L(J^)) = 3 when q is even. 

c) Show that x{Jq) — 3 and x(L(J^)) = 4 when q is odd. Note: Let fsnark (q) denote 
the clauses (15) and (16) that correspond to 3-coloring L ( J q )^ together with 61,1 A 
ci, 2 Adi, 3 to set the colors of ( 61 ， ci ， di) to (1 ， 2,3). Also let fsnark f (q) be fsnark (q) 
augmented by (17). These clauses make excellent benchmark tests for SAT solvers- 

177. [HM26] Let I q be the number of independent sets of the flower snark line graph 
L{J q ). Compute I q for 1 < ^ < 8, and determine the asymptotic growth rate- 

► 178. [M23] When Algorithm B is presented with the unsatisfiable clauses fsnark (q) 
of exercise 176, with q odd, its speed depends critically on the ordering of the variables- 
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Show that the running time is 0(2^) when the variables are considered in the order 

ai, 1 ai,2ai,3^1,1 261,3C1,1 Cl,2Cl,3^1,lC?l,2c?l,361,161,2 61,3/1,1 /l,2/1,3^2,1 a*2, 2«2,3 …； 

but much, much more time is needed when the order is 

ai,161,ici,ic?i,iei,1/1,ia2,i&2,iC2,1^2,162,1/2,1 - . - a q ,ib q ,ic q ， id q ,ie q ,i ••… 

179. [25] Show that there are exactly 4380 ways to fill the 32 cells of the 5-cube with 
eight 4 - element subcubes. For example, one such way is to use the subcubes 000 **， 
001** ， … ， 111**, in the notation of 7.1.1—( 29 ); a more interesting way is to use 

0*0*0， 1*0*0， **001， **110， *010*， *110*， 0**11, 1**11. 


What does this fact tell you about the value of qg in Fig. 40? 


180. [25] Explain how to use BDDs to compute the numbers Q m that underlie Fig. 40- 
What is max 0 < m <80 Qm? 

181. [25] Extend the idea of the previous exercise so that it is possible to determine 
the probability distributions T m of Fig. 41. 

182. [Ml 6] For which values of m in Fig. 41 does T m have a constant value? 

Discuss the relation between Figs- 42 and 43. 

Why does ( 77 ) characterize the relation between q m and q m l 
Use ( 77 ) to prove the intuitively obvious fact that q m > q m . 

Use ( 77 ) to reduce q m and E m (2m + l)q m to ( 78 ) and ( 79 ). 


183 - [M30] 

184. [M20] 

185. [M20] 
186 - [M21] 


187. [M20] Analyze random satisfiability in the case k = n: What are S n , n and S n , n l 

188. [HM25] Analyze random 1SAT，the case k = 1: What are ^ 1 ^ and Si^ n l 

189. [27] Apply BDD methods to random 3SAT problems on 50 variables. What is the 
approximate BDD size after m distinct clauses have been ANDed together, as m grows? 


190. [M20] Exhibit a Boolean function of 4 variables that can’t be expressed in 3CNF. 
(No auxiliary variables are allowed: Only xi, a> 2 , X 3 , and may appear.) 


191. [M25] How many Boolean functions of 4 variables can be expressed in 3CNF? 

► 192. [HM21 ] Another way to model satisfiability when there are N equally likely 
clauses is to study S{p), the probability of satisfiability when each clause is indepen¬ 
dently present with probability p. 

a) Express S(p) in terms of the numbers Q m = 

b) Assign uniform random numbers in [0.. 1) to each clause; then at time ^ for 
0 < t < consider all clauses that have been assigned a number less than t/N. 
(Approximately t clauses will therefore be selected, when N is large.) Show that 
Sk,n = Sk, n (t/N) dt^ the expected amount of time during which the chosen 
clauses remain satisfiable, is very similar to the satisfiability threshold Sk,n of ( 76 ). 


193. [HM 48 ] Determine the satisfiability threshold ( 81 ) of random 3SAT. Is it true 
that liminf n _^oo S^.u/n = lim sup n _ >oc Sz^ n /nl If so, is the limit s 4.2667? 

194. [HM49] If a < liminfn-^oo Sz,n/n^ is there a polynomial-time algorithm that is 
able to satisfy |/m」random 3SAT clauses with probability > for some ^ > 0? 


195. [HM21] (J- Franco and M. Pauli 5 1983.) Use the first moment principle MPR—( 21 ) 
to prove that [( 2 & 1112 )? 2 」 random /cSAT clauses are almost always unsatisfiable- Hint: 
Let X = [x satisfies all clauses], summed over all 2 n binary vectors x = xi ... x n . 
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SATISFIABILITY: EXERCISES 


149 


► 196. [HM25] (D. B. Wilson.) A clause of a satisfiability problem is “easy” if it 

contains one or more variables that don’t appear in any other clauses. Prove that, 

with probability 1 — 0(n _2e )，a /cSAT problem that has m = Lan」random clauses 
contains (1 — (1 — e~ ka ) k )m-\- 0 (n" 2+e ) easy ones. (For example, about 0.000035n of 
the 4.27n clauses in a random 3SAT problem near the threshold will be easy.) 

197. [HM21] Prove that the quotient g(a, 6, A, B, n) = ( ( H n )/((Ud )n ) 

is 0 (n - V 2 ) as n —> oo, if a ， b^A^B > 0 . 

► 198. [HM30] Use exercises 196 and 197 to show that the phase transition in Fig. 46 is 

not extremely abrupt: If Sz{m^n) > | and Sz{m ^n) < prove that m = m + 

199. [M21] Let p{t^ m, N) be the probability that t specified letters each occur at least 
once within a random m - letter word on an TV - letter alphabet. 

a) Prove that N) < /N l . 

b) Derive the exact formula p(t, m, N) = (^)( —1)^(^ — k) m /N m . 

c) And 池 iV)A! = {;}(▽)〆- {Y}GTi )/^ +1 + (T)GT 2 )/^ +2 _ . 

► 200. [M21] Complete the text’s proof of ( 84 ) when c < 1: 

a) Show that every unsatisfiable 2SAT formula contains clauses of a snare- 

b) Conversely, are the clauses of a snare always unsatisfiable? 

c) Verify the inequality ( 8 g). Hint: See exercise 199. 

201. [HM29] The t-snake clauses specified by a chain (/i,..., ht-i) can be written 
(Ji V U+i) for 0 < i < 2^ where lo = It and subscripts are treated mod 2t. 

a) Describe all ways to set two of the Vs so that {x\ V X 2 ) is one of those 2t clauses. 

b) Similarly, set three Vs in order to obtain (xi V X 2 ) and (X 2 V ^ 3 )- 

c) Also set three to obtain both (xo V x\) and (xt-i V xt); here 三 and t > 2. 

d) How can the clauses (xiW x^i) for 0 < i < t all be obtained by setting t of the Vs? 

e) In general 5 let iV(g ， r) be the number of ways to choose r of the standard clauses 

(xi V which involve exactly q of the variables {xi，• • •, X 2 t-i}^ and to set q 

values of {Zi ， … ， l 2 t-i} in order to obtain the r chosen clauses- Evaluate N(2, 1). 

f) Similarly，evaluate iV(3,2 )， N(t ， t\ and N(2t — 1, 2t). 

g) Show that the probability p r in ( 95 ) is < N(q^r)/ 

h) Therefore the upper bound ( 96 ) is valid. 

202. [HM21 ] This exercise amplifies the text’s proof of Theorem C when c > 1. 

a) Explain the right-hand side of Eq. ( 93 ). 

b) Why does ( 97 ) follow from ( 95 ) ，（ 96 )，and the stated choices of t and ml 

► 203. [HM33] (K. Xu and W. Li, 2000.) Beginning with the n graph-coloring clauses 
( 15 ), and optionally the n ⑵ exclusion clauses ( 17 ), consider using randomly generated 
binary clauses instead of ( 16 ) • There are mq random binary clauses, obtained as m 
independent sets of q clauses each, where every such set is selected by choosing distinct 
vertices u and then choosing q distinct binary clauses (ui V Vj) for I < i^j < d. 
(The number of different possible sequences of random clauses is therefore exactly 
(( 2 ) ( d q)) m an d each sequence is equally likely.) This method of clause generation is 
known as “Model RB ”； it generalizes random 2SAT, which is the case d = 2 and g = 1- 

Suppose d = n a and q = pd 2 ^ where we require * < a < 1 and 0 < p < |. 
Also let m = rn In d. For this range of the parameters, we will prove that there is 
a sharp threshold of satisfiability: The clauses are unsatisfiable q.s., as n — 00 , if 
r ln(l — p) + 1 < 0 ; but they are satisfiable a-s. if r ln(l — p) + 1 > 0 . 


Wilson 
easy clauses 

bystanders, see easy clauses 

binomial coefficients 

phase transition 

sharp threshold 

random word 

Stirling subset numbers 

2SAT 

snare 

t-snake 

Xu 

Li 

exclusion clauses 

at-most-one 

Model RB 

random 2SAT 

threshold of satisfiability 

q.s. 


September 23, 2015 



150 COMBINATORIAL ALGORITHMS (F6A: 23 Sep 2015 @ 1959) 


7.2.2.2 


Let X(ji, •. • ， j n ) = [all clauses are satisfied when each ith variable v has Vj { = 1]; 
here 1 乞 ji，• • • ， jn < d. Also let X = X(ji，• • • ， j n ). Then X = 0 if and 

only if the clauses are unsatisfiable. 

a) Use the first moment principle to prove that X = 0 q.s. when r ln(l — p) + 1 < 0. 

b) Find a formula for p s = Pr(X(ji, • • •, j n ) = 1 | X(l,. .. 5 1) = 1), given that 
exactly s of the colors {ji，•••，▲} are equal to 1 . 

c) Use (b) and the conditional expectation inequality MPR—( 24 ) to prove that X > 0 
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d) Letting t s denote the term for s in that sum, prove that t s » 1. 

e) Suppose r ln(l — p) + 1 = e > 0, where e is small. Show that the terms t s first 
increase, then decrease, then increase, then decrease again, as s grows from 0 
to n. Hint: Consider the ratio x s = 

f) Finally，prove that t s is exponentially small for 3n/d < s < n. 


► 204. [28] Figure 46 might suggest that 3SAT problems on n variables are always easy 
when there are fewer than 2n clauses. We shall prove, however, that any set of m 
ternary clauses on n variables can be transformed mechanically into another set of 
ternary clauses on TV = 0{m) variables in which no variable occurs more than four 
times. The transformed problem is satisfiable if and only if the original problem was; 
thus it isn’t any simpler，although (with at most AN literals) it has at most |iV clauses. 

a) First replace the original m clauses by m new clauses (Xi VX 2 VX 3 ) ， ••” （為爪 —2 V 
X^m-i V X 3 m ), on 3m new variables, and show how to add 3m clauses of size 2 
so that the resulting 4m clauses have exactly as many solutions as the original- 

b) Construct 16 unsatisfiable ternary clauses on 15 variables，where each variable 
occurs at most four times. Hint: If F and F f are sets of clauses, let FUF r stand 
for any other set obtained from FUF’ by replacing one or more clauses (7 of F by 
xUC and one or more clauses C f of F f by xl)C f ^ where x is a new variable; then 
FUF r is unsatisfiable whenever F and F f are both unsatisfiable. For example, if 
F= {e} and F' = {1,1}, then FUF f is either {2,12,12} or {2,1,12} or {2,12,1}. 

c) Remove one of the clauses from solution (b) and find all solutions of the 15 clauses 
that remain. (At least three of the variables will have forced values.) 

d) Use (a), (b), and (c) to prove the TV - variable result claimed above- 

205. [26] Construct an unsatisfiable 4SAT problem in which every variable occurs at 
most 5 times. Hint: Use the U operation as in the previous exercise. 


206. [M22] A set of clauses is minimally unsatisfiable if it is unsatisfiable, yet becomes 
satisfiable if any clause is deleted. Show that, if F and F f have no variables in common, 
then FUF 1 is minimally unsatisfiable if and only if F and F f are minimally unsatisfiable. 

207. [25] Each of the literals {1 ， i ， 2, 3, 3,4, occurs exactly thrice in the eight 

unsatisfiable clauses ( 6 ) - Construct an unsatisfiable 3SAT problem with 15 variables in 
which each of the 30 literals occurs exactly twice. Hint: Consider {12, 23, 31, 123, 123}. 

208. [25] Via exercises 204(a) and 207， show that any 3SAT problem can be trans¬ 
formed into an equivalent set of ternary clauses where every literal occurs just twice* 


209. [25] (C. A. Tovey.) Prove that every /cSAT formula in which no variable occurs 
more than k times is satisfiable. (Thus the limits on occurrences in exercises 204-208 
cannot be lowered，when k = 3 and k = 4.) Hint: Use the theory of bipartite matching. 


first moment principle 
conditional expectation inequality 
4SAT 

minimally unsatisfiable 
clauses per literal 
Tovey 

bipartite matching 
matching 


September 23, 2015 



7.2.2.2 SATISFIABILITY: EXERCISES 151 

210. [M36] But the result in the previous exercise can be improved when k is large- 
Use the Local Lemma to show that every 7SAT problem with at most 13 occurrences 
of each variable is satisfiable. 

211. [30] (R. W- Irving and M. Jerrum ， 1994.) Use exercise 208 to reduce 3SAT to the 
problem of list coloring a grid graph of the form Kn □ K 3 . (Hence the latter problem, 
which is also called latin rectangle construction^ is NP-complete.) 

212. [32] Continuing the previous exercise, we shall reduce grid list coloring to another 
interesting problem called partial latin square construction. Given three n x n binary 
matrices {rik), (Cjk), (Pij), the task is to construct an n x n array (Xij ) such that Xij 
is blank when pij = 0 , otherwise Xij = k for some k with = Cjk = 1 ； furthermore 
the nonblank entries must be distinct in each row and column. 

a) Show that this problem is symmetrical in all three coordinates: It’s equivalent to 
constructing a binary n x n x n tensor (xijk) such that x^jk = Cj/c, Xi^k = ”i/c ， 
and Xij* = pij^ for 1 < i, j, A: < n, where denotes summing an index from 1 
to n. (Therefore it is also known as the binary n x n x n contingency problem, 
given n 2 row sums, n 2 column sums, and n 2 pile sums.) 

b) A necessary condition for solution is that c^k = = p 小 and 

Exhibit a small example where this condition is not sufficient. 

c) If M < N, reduce KmUKn list coloring to the problem of Kn uKn list coloring. 

d) Finally，explain how to reduce KnUKn list coloring to the problem of constructing 

an n x n partial latin square, where n = TV + j |L (/ 5 J)\. Hint: Instead of 
considering integers 1 < A: < n, let i ， j ， A: range over a set of n elements. 
Define pij = 0 for most values of i and j; also make for all i and k. 

► 213. [M20] Experience with the analyses of sorting algorithms in Chapter 5 suggests 
that random satisfiability problems might be modeled nicely if we assume that, in each 
of m independent clauses，the literals Xj and Xj occur with respective probabilities p 
and independently for 1 < j < n, where p + ^ < 1 - Why is this not an interesting 
model as n 4 00 , when p and q are constant? Hint: What is the probability that 
xi ... x n = bi .. .b n satisfies all of the clauses, when b\ .. .b n is a given binary vector? 

214. [HM38] Although the random model in the preceding exercise doesn’t teach us 
how to solve SAT problems, it does lead to interesting mathematics: Let 0 < p < 1 and 
consider the recurrence 


T o = 0; 



for n > 0. 


a) Find a functional relation satisfied by T{z) = T n z n /n\. 

b) Deduce that we have T(z) = ze z ^^ = 0 (2p) m (1 — 

c) Hence，if p 1/2, we can use Mellin transforms (as in the derivation of 5.2.2—( 50 )) 
to show that T n = C p n a (l + S(n) + 0(l/n)) + n/(l — 2p), where a = l/lg(l/p), 
C p is a constant，and ^ is a small “wobble” with S(n) = S(pn). 

► 215. [HM23] What is the expected profile of the search tree when a simple backtrack 
procedure is used to find all solutions to a random 3SAT problem with m independent 
clauses on n variables? (There is a node on level l for every partial solution x\ .. .xi 
that doesn’t contradict any of the clauses.) Compute these values when m = 200 and 
n = 50. Also estimate the total tree size when m = an, for fixed a as n —)• 00 . 

216. [HM38] (R W. Purdom ， Jr” and C. A. Brown.) Extend the previous exercise to 
a more sophisticated kind of backtracking, where all choices forced by unit clauses are 
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pursued before two-way branching is done. (The “pure literal rule” is not exploited, 
however, because it doesn’t find all solutions.) Prove that the expected tree size is 
greatly reduced when m = 200 and n = 50. (An upper bound is sufficient-) 

217. [20] True or false: If A and B are arbitrary clauses that are simultaneously 
satisfiable, and if l is any literal, then the clause C = (A\J B)\ {/, 1} is also satisfiable. 
(We’re thinking here of A, and C as sets of literals，not as disjunctions of literals.) 

218. [20] Express the formula (xVA) A(xVJ5) in terms of the ternary operator u? v:w. 

► 219. [M20] Formulate a general definition of the resolution operator C = C r oC n that 
(i) agrees with the text’s definition when C f = x\/ A f and C n = x VA"; (ii) applies to 
arbitrary clauses C r and (7"; (iii) has the property that C r A C" implies C r o C" • 

220. [M24] We say that clause C subsumes clause C\ written C C C f ^ if C f = p or if 
C f ^ p and every literal of C appears in C f • 

a) True or false: C C C' and C' C C" implies C C C". 

b) True or false: ((7 V a) o {C r V a r ) C ((7 o C r ) V a V a’，with o as in exercise 219. 

c) True or false: C' C C" implies CoC' C CoC". 

d) The notation (7i” •” C m C means that a resolution chain (7i，• • • ， C m + r exists 
with C m + r C for some r > 0 - Show that we might have (7i ， …， C m h C even 
though C cannot be obtained from {(7i，• •. ， C m } by successive resolutions ( 104 ). 

e) Prove that if C\ C Cj, ..., C m C C' m , and ,..., C' m h (7, then Ci,..., C m h C. 

f) Furthermore (7i ，…， h (7 implies (7i V ai ， … ， （ 7 m V a m 卜 （7 V ai V • • • V a m . 

221. [16] Draw the search tree analogous to Fig. 38 that is implicitly traversed when 
Algorithm A is applied to the unsatisfiable clauses {12, 2, 2}. Explain why it does not 
correspond to a resolution refutation that is analogous to Fig. 48. 

222. [M30] (Oliver Kullmann ， 2000.) Prove that，for every clause (7 in a satisfiability 
problem F, there is an autarky satisfying C if and only if C cannot be used as the label 
of a source vertex in any resolution refutation of F. 

223. [HM40] Step X9 deduces a binary clause that cannot be derived by resolution 
(see exercise 166)• Prove that, nevertheless, the running time of Algorithm L on un¬ 
satisfiable input will never be less than the length of a shortest treelike refutation. 

224. [M20] Given a resolution tree that refutes the axioms F\x^ show how to construct 
a resolution tree of the same size that either refutes the axioms F or derives the clause 
{x} from F without resolving on the variable x. 

► 225. [M31] (G. S- Tseytin ， 1966.) If T is any resolution tree that refutes a set of 
axioms show how to convert it to a regular resolution tree T r that refutes F, where 
T r is no larger than T. 

226. [M20] If a is a node in a refutation tree，let C(a) be its label，and let ||a|| denote 
the number of leaves in its subtree. Show that，given a refutation tree with N leaves, 
the Prover can find a node with ||a|| < N/2 S for which the current assignment falsifies 
(7(a)，whenever the Delayer has scored s points in the Prover—Delayer game. 

227. [M27] Given an extended binary tree, exercise 7-2.1.6-124 explains how to label 
each node with its Horton-Strahler number. For example, the nodes at depth 2 in 
Fig. 48 are labeled 1, because their children have the labels 1 and 0; the root is labeled 3. 

Prove that the maximum score that the Delayer can guarantee, when playing 
the Prover—Delayer game for a set of unsatisfiable clauses is equal to the minimum 
possible Horton-Strahler root label in a tree refutation of F. 
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► 228. [M21] Stalmarck^s refutation of ( 99 )-( 101 ) actually obtains e without using all of 
the axioms! Show that only about 1/3 of those clauses are sufficient for unsatisfiability. 

► 229. [M21] Continuing exercise 228， prove also that the set of clauses ( 99 ) ， ( 10 c /)， 
( 101 ) is unsatisfiable，where (ioo 7 ) denotes ( 100 ) restricted to the cases i < k and j < k. 

230. [M22] Show that the clauses with i ^ j in the previous exercise form a minimal 
unsatisfiable set: Removing any one of them leaves a satisfiable remainder. 

231 • [M30] (Sam Buss.) Refute the clauses of exercise 229 with a resolution chain of 
length 0(m 3 ). Hint: Derive the clauses Gij = (a^jVa^(j+i)V. • for 1 < i < j < m- 

► 232. [M28] Prove that the clauses fsnark(q) of exercise 176 can be refuted by treelike 
resolution in 0 (q 6 ) steps. 

233. [16] Explain why ( 105 ) satisfies ( 104 )，by exhibiting j(i) and k(i) for 9 < i < 22. 

234. [20] Show that the Delayer can score at least m points against any Prover who 
tries to refute the pigeonhole clauses ( 106 ) and ( 107 ). 

► 235. [30] Refute those pigeonhole clauses with a chain of length m(m+3)2 m_2 . 

236 • [48] Is the chain in the previous exercise as short as possible? 

► 237. [28] Show that a polynomial number of steps suffice to refute the pigeonhole 
clauses ( 106 ), ( 107 ), if the extended resolution trick is used to append new clauses. 

238. [HM21 ] Complete the proof of Lemma B. Hint: Make r < p- b when W = b. 

► 239. [M21] What clauses ao on n variables make ||ao 卜 e|| as large as possible? 

► 240. [HM23] Choose integers fij G {1 ， … ， m} uniformly at random，for 1 < i < 5 
and 0 < j < m, and let Go be the bipartite graph with edges aj —— bk if and only if A: G 
{/ij， ••” f^j }• Show that Pr(Go satisfies the strong expansion condition ( 108 )) > 1/2. 

241 • [20] Prove that any set of at most m/3000 pigeons can be matched to distinct 
holes, under the restricted pigeonhole constraints Go of Theorem B. 

242. [M20] The pigeonhole axioms ( 106 ) and ( 107 ) are equivalent to the clauses ( 15 ) 
and ( 16 ) that arise if we try to color the complete graph K m +i with m colors- 

Suppose we include further axioms corresponding to ( 17 )，namely 

(xjk V Xjk 1 )^ for 0 < j < m and 1 < k < k f < m. 

Does Theorem B still hold，or do these additional axioms decrease the refutation width? 

243. [HM31 ] (E. Ben-Sasson and A. Wigderson*) Let F be a set of \_an\ random 
3SAT clauses on n variables, where a > 1/e is a given constant. For any clause C on 
those variables, define /^(C) = min{ \F f \ \ F f C F and F' h (7). Also let V{F') denote 
the variables that occur in a given family of clauses F f . 

a) Prove that \V(F f )\ > \F f \ a.s., when F f C F and |^| < n/(2ae 2 ). 

b) Therefore either F is satisfiable or > n/(2ae 2 ) ， a.s. 

c) Let n = n/(1000000a 4 ), and assume that n >2. Prove that 2\V(F r )\ — 3\F r \ > 
n /A q.s., when F f C F and n /2 < \F f \ <n . 

d) Consequently either F is satisfiable or w(F \~ e) > n’/4, a.s. 

244. [M20] If A is a set of variables, let [A] 0 or [A] 1 stand for the set of all clauses 
that can be formed from A with an even or odd number of negative literals ， respec¬ 
tively; each clause should involve all of the variables. (For example ， [{1 5 2,3 }] 1 = 
{123,123,123,123}.) If A and B are disjoint，express [A U B]° in terms of the sets 
㈤ 0, [A] 1 , [B]°, [B}\ 
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► 245. [M27] Let (7 be a connected graph whose vertices v ^ V have each been labeled 
0 or 1， where the sum of all labels is odd. We will construct clauses on the set of 
variables e ㈣， one for each edge u —— v in G. The axioms are a{v) = [E(v)]’ ㈤㊉ 1 for 
each v ^ V (see exercise 244), where E{v) = {e uv \ u —— v} and l(v) is the label of v. 

For example, vertex 1 of the graph below is shown as a black dot in order to 
indicate that Z(l) = 1， while the other vertices appear as white dots and are labeled 
1(2) = … =1(6) =0. The graph and its axioms are 

a(l) = {af ， df }， a(4) = {cd^ cd}^ 

a(2) = {ab&, abg, dbg, db&} ， a(5) = {deh, d 巨 h, deh, d 巨 h}, 
a(3) = {bch^ bch^ bch^ bch}^ a(6) = {efg^ efg^ efg^ efg}. 

Notice that，when v has d > 0 neighbors in the set a{v) consists of 2 d ~~ 1 clauses of 
size d. Furthermore, the axioms of a{v) are all satisfied if and only if 

^UV — • 

^uv 

If we sum this equation over all vertices mod 2, we get 0 on the left，because each 
edge e uv occurs exactly twice (once in E(u) and once in E(v)) • But we get 1 on the 
right. Therefore the clauses a(G) = IJ^ a{v) are unsatisfiable. 

a) The axioms a{G) \ b and a(G) \ b in this example turn out to be a{G r ) and a{G n )^ 

where G r = and G" = Explain what happens in general. 

b) Let fi(C) = min{ \V'\ \V' <ZV and LLev a ( v ) 卜 C}, for every clause C involv¬ 
ing the variables e uv . Show that fi(C) = 1 for every axiom C G a(G). What is "(e)? 

c) If V f C y, let dv' = {e U v \ U e V v V'}. Prove that, if |F’| = p(C), 
every variable of dV f appears in C. 

d) A nonbipartite cubic Ramanujan graph G on m vertices V has three edges v —— vp, 

v —— vg) v —— vr touching each vertex, where p, a, and r are permutations with 
the following properties: (i) p = p 一 and r = ; (ii) G is connected; (iii) If V f is 

any subset of s vertices, and if there are t edges between V f and V\V\ then we 
have s/(s + t) < (s/n + 8)/9. Prove that w{a{G) h e) > m/78. 

► 246. [M28] (G- S- Tseytin.) Given a labeled graph G with m edges, n vertices，and 
N unsatisfiable clauses a{G) as in the previous exercise, explain how to refute those 
clauses with 0(mn + N) steps of extended resolution. 

247. [18] Apply variable elimination to just five of the six clauses ( 112 ), omitting c 12\ 

248. [M20] Formally speaking, SAT is the problem of evaluating the quantified for¬ 
mula 

3xi ••• 3x^—1 3x^2 F (xi 5 •. • 5 Xri— 1 •) Xn ) •) 

where F is a Boolean function given in CNF as a conjunction of clauses- Explain how 
to transform the CNF for F into the CNF for F r in the reduced problem 

彐 Xl … 彐 一 iF’(Xi ，…， X n -1) ， F’(X1 ，…， X n -1) = F{xi^. .. ^x n -i^)y F{xi^. . . ^X n -I^l). 
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249. [18] Apply Algorithm I to ( 112 ) using Cook’s Method IA. 

250. [25] Since the clauses R f in ( 7 ) are satisfiable, Algorithm I might discover a 
solution without ever reaching step 14. Try ， however, to make the choices in steps 12, 
13, and 14 so that the algorithm takes as long as possible to discover a solution. 
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► 251. [30] Show that Algorithm I can prove the unsatisfiability of the anti-maximal- 
element clauses ( 99 )-( 101 ) by making 0 (m 3 ) resolutions，if suitably clairvoyant choices 
are made in steps 12, 13, and 14. 

252. [M26] Can the unsatisfiability of ( 99 )-( 101 ) be proved in polynomial time by 
repeatedly performing variable elimination and subsumption? 

► 253. [18] What are the next two clauses learned if decision ‘5’ follows next after ( 114 )? 

254. [16] Given the binary clauses {12,13, 23,24, 34}，what clause will a CDCL solver 
learn first if it begins by deciding that 1 is true? 

► 255. [20] Construct a satisfiability problem with ternary clauses, for which a CDCL 
solver that is started with decision literals c l\ ‘2% ‘3’ on levels 1 ， 2， and 3 will learn 
the clause ‘45’ after a conflict on level 3- 

256. [20] How might the clause in Table 3 have been easily learned? 

► 257. [30] (Niklas Sorensson.) A literal l is said to be redundant, with respect to a given 
clause c and the current trail, if l is in the trail and either (i) l is defined at level 0 , or (ii) l 
is not a decision literal and every false literal in Vs reason is either in c or (recursively) 
redundant- (This definition is stronger than the special cases by which ( 115 ) reduces 
to ( 116 )，because l itself needn’t belong to c.) If, for example, c = (P V 61 V 62 V 63 V 64 )， 
let the reason for 64 be (64 V 61 V ai), where the reason for a 1 is (a 1 V 62 V 0 , 2 ) and the 
reason for a 2 is (a ‘2 V 61 V 63 ) - Then 64 is redundant, because a 2 and di are redundant. 

a) Suppose c = (P V 61 V • • • V b r ) is a newly learned clause. Prove that if bj G c is 
redundant, some other 6 ^ G c became false on the same level of the trail as bj did- 

b) Devise an efficient algorithm that discovers all of the redundant literals bi in a 
given newly learned clause c = (P V 61 V • • • V 6 r ). Hint: Use stamps. 

258. [21] A non-decision literal l in Algorithm C’s trail always has a reason Ri = 
(/o V /1 V • • • V Z/c 一 1 )，where lo = l and Zi ， • • • ， lk—i precede l in the trail- Furthermore, 
the algorithm discovered this clause while looking at the watch list of h. True or false: 
『 2 , • • • ， precede h in the trail Hint: Consider Table 3 and its sequel. 

259. [M20] Can ACT(j) exceed ACT (AO for values of p near 0 or 1， but not for all pi 

260. [18] Describe in detail step Cl’s setting-up of MEM，the watch lists，and the trail. 

261. [21] The main loop of Algorithm C is the unit-propagation process of steps C3 
and C4. Describe the low-level details of link adjustment, etc., to be done in those steps. 

262. [20] What low-level operations underlie changes to the heap in steps C 6 -C 8 ? 

263. [21] Write out the gory details by which step C7 constructs a new clause and 
step C9 puts it into the data structures of Algorithm C. 

264. [20] Suggest a way by which Algorithm C could indicate progress by displaying 
“move codes” analogous to those of Algorithms A, B, D，and L. (See exercise 142.) 

265. [21] Describe several circumstances in which the watched literals lo and/or h of 
a clause c actually become false during the execution of Algorithm C- 

266. [20] In order to keep from getting into a rut，CDCL solvers are often designed to 
make decisions at random, with a small probability p (say p = . 02 ), instead of always 
choosing a variable of maximum activity. How would this policy change step C 6 ? 

► 267. [25] Instances of SAT often contain numerous binary clauses, which are handled 
efficiently by the unit-propagation loop ( 62 ) of Algorithm L but not by the correspond¬ 
ing loop in step C3 of Algorithm C. (The technique of watched literals is great for long 
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clauses, but it is comparatively cumbersome for short ones.) What additional data 
structures will speed up Algorithm C’s inner loop, when binary clauses are abundant? 

268. [21] When Algorithm C makes a literal false at level 0 of the trail，we can remove 
it from all of the clauses. Such updating might take a long time, if we did it “eagerly ”； 
but there’s a lazy way out: We can delete a permanently false literal if we happen to 
encounter it in step C3 while looking for a new literal to watch (see exercise 261). 

Explain how to adapt the MEM data structure conventions so that such deletions 
can be done in situ, without copying clauses from one location into another. 

269 • [23] Suppose Algorithm C reaches a conflict at level d of the trail，after having 
chosen the decision literals m, 乜 2 , ••” Ud. Then the “trivial clause” (V W uiW - -W u^) 
must be true if the given clauses are satisfiable, where l f and d! are defined in step C7. 

a) Show that，if we start with the clause (f V 61 V • • • V b r ) that is obtained in 
step C7 and then resolve it somehow with zero or more known clauses, we can 
always reach a clause that subsumes the trivial clause. 

b) Sometimes, as in ( 115 )，the clause that is slated to be learned in step C9 is much 
longer than the trivial clause. Construct an example in which d = 3, d’ = 1, and 
r = 10, yet none of 61 ，…， h r are redundant in the sense of exercise 257. 

c) Suggest a way to improve Algorithm C accordingly. 

270. [25] (On-the-fly subsumption.) The intermediate clauses that arise in step C7, 
immediately after resolving with a reason Ri, occasionally turn out to be equal to the 
shorter clause Ri\l. In such cases we have an opportunity to strengthen that clause 
by deleting l from it, thus making it potentially more useful in the future. 

a) Construct an example where two clauses can each be subsumed in this way while 
resolving a single conflict. The subsumed clauses should both contain two literals 
assigned at the current level in the trail, as well as one literal from a lower level. 

b) Show that it’s easy to recognize such opportunities, and to strengthen such clauses 
efficiently, by modifying the steps of answer 263- 

► 271. [25] The sequence of learned clauses Ci ， C 2 , •… often includes cases where Ci 
subsumes its immediate predecessor, Ci—i. In such cases we might as well discard 
Ci—i, which appears at the very end of MEM, and store Ci in its place, unless Ci-i is 
still in use as a reason for some literal on the trail. (For example, more than 8,600 
of the 52,000 clauses typically learned from waerden(3^ 10; 97) by Algorithm C can be 
discarded in this way- Such discards are different from the on-the-fly subsumptions 
considered in exercise 270， because the subsumed Ci 一 1 includes only one literal from 
its original conflict level; furthermore, learned clauses have usually been significantly 
simplified by the procedure of exercise 257, unless they’re trivial.) 

Design an efficient way to discover when Ci-i can be safely discarded. 

272. [30] Experiment with the following idea: The clauses of waerden (j, k] n) are 
symmetrical under reflection 5 in the sense that they remain unchanged overall if we 
replace Xk by = x n +i-k for 1 < k < n. Therefore, whenever Algorithm C learns 
a clause C = (『’ V 61 V • • • V 6 r )，it is also entitled to learn the reflected clause C R = 
([ /i? V 6 f V---V 6 f). 

273. [27] A clause C that is learned from waerden (j, hr, n) is valid also with respect 
to waerden [j, k; n) when n > n\ and so are the clauses C + i that are obtained by 
adding i to each literal of for 1 < i < n — n. For example, the fact that ‘35’ follows 
from waerden (3, 3; 7) allows us to add the clauses 35, 46， 57 to waerden (3, 3; 9). 

a) Exploit this idea to speed up the calculation of van der Waerden numbers- 
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b) Explain how to apply it also to bounded model checking. 

274. [35] Algorithm C sets the “reason” for a literal l as soon as it notices a clause 
that forces l to be true- Later on, other clauses that force l are often encountered, in 
practice; but Algorithm C ignores them, even though one of them might be a “better 
reason.” (For example, another forcing clause might be significantly shorter.) Explore 
a modification of Algorithm C that tries to improve the reasons of non-decision literals. 

► 275. [22] Adapt Algorithm C to the problem of finding the lexicographically smallest 
solution to a satisfiability problem, by incorporating the ideas of exercise 109- 

276. [Ml5] True or false: If F is a family of clauses and L is a set of strictly distinct 
literals, then F A L hi e if and only if (F\L) hi e. 

277. [Ml8] If ((7i, … , Ct) is a certificate of unsatisfiability for F, and if all clauses 
of F have length > 2, prove that some C{ is a unit clause. 

278. [22] Find a six-step certificate of unsatisfiability for waerden (3, 3; 9). 

279. [M20] True or false: Every unsatisfiable 2 SAT problem has a certificate ‘(Z ， e)’. 

► 280. [M26] The problem cook(j,k) consists of all (p positive j-clauses and all (^) 
negative A:-clauses on {1 ， •…, n}，where n = j + k — 1. For example, cook (2^ 3) is 

{12, 13, 14, 23, 24, 34, 123, 124, 134, 234}. 

a) Why are these clauses obviously unsatisfiable? 

b) Find a totally positive certificate for cook (j, k), of length (】：;)• 

c) Prove in fact that Algorithm C always learns exactly clauses when it proves 

the unsatisfiability of cook[j, A:)，if M p = Mf = oo (no purging or flushing). 

281. [21] Construct a certificate of unsatisfiability that refutes ( 99 ) ，（ 100 ) ， ( 101 )- 

► 282. [MSS] Construct a certificate of unsatisfiability for the clauses fsnark(q) of exer¬ 

cise 176 when g > 3 is odd，using O(q) clauses, all having length < 4. Hint: Include the 
clauses ( 句， P V ^， P )， ( 句， P V/j>) ， fe ， P V/j， p )，and fj^ p ) for 1 < j < q, 1 < p < 3. 

283. [HM 46 ] Does Algorithm C solve the flower snark problem in linear time? More 
precisely, let p q (M) be the probability that the algorithm refutes fsnark (q) while mak¬ 
ing at most M references to MEM. Is there a constant N such that p q (Nq) > | for all ql 

284. [23] Given F and ((7i，• • • ， G)，a certificate-checking program tests condition 
( 119 ) by verifying that F and clauses (7i ，•••， Ci 一 1 will force a conflict when they 
are augmented by the unit literals of Ci. While doing this，it can mark each clause 
of F U {Ci, …， Ci-i} that was reduced to a unit during the forcing process; then the 
truth of Ci does not depend on the truth of any unmarked clause. 

In practice, many clauses of F are never marked at all, hence F will remain 
unsatisfiable even if we leave them out- Furthermore, many clauses Ci are not marked 
during the verification of any of their successors 5 {G+i ，... ^Ct}] such clauses Ci needn’t 
be verified, nor need we mark any of the clauses on which they depend- 

Therefore we can save work by checking the certificate backwards: Start by 
marking the final clause Ct, which is e and always needs to be verified- Then，for 
i = t, t — 1, • 一， check Ci only if it has been marked. 

The unit propagations can all be done without recording the “reason” _R/ that 
has caused any literal l to be forced- In practice, however, many of the forced literals 
don’t actually contribute to the conflicts that arise，and we don’t want to mark any 
clauses that aren’t really involved. 

Explain how to use reasons, as in Algorithm C, so that clauses are marked by the 
verifier only if they actually participate in the proof of a marked clause Ci. 
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285. [19] Using the data in Fig. 50， the text observes that Eq. ( 124 ) gives j = 95, 

Sj = 3081, and rrij = 59 when a = What are j, Sj, and rrij when (a) a = 吾？ 
(b) a = (c) a = jq? Also compare the effectiveness of different a’s by computing the 

number bj of “black” clauses (those with 0 < RANGE(c) < j that proved to be useful)- 

286. [M24] What choice of signatures-to-keep in Fig. 50 is optimum，in the sense that 
it maximizes ^2b pq x pq subject to the conditions ^ a pq x pq < 3114 ， x pq G {0 ， 1}, and 
x pq > x p f q f for 1 < p < p 7 < 7, 0 < ^ < ^ < 8 ? Here a pq and b pq are the areas of the 
gray and black clauses that have signature (p ， g), as given by the matrices in the text- 
[This is a special case of the “knapsack problem with a partial ordering.”] 

287. [25] What changes to Algorithm C are necessary to make it do a “full run，” and 
later to learn from all of the conflicts that arose during that run? 

288. [28] Spell out the details of computing RANGE scores and then compressing the 
database of learned clauses, during a round of purging. 

289. [M20] Assume that the kth round of purging begins with yk clauses in memory 
after kA + O clauses have been learned, and that purging removes ^yk of those 
clauses. Find a closed formula for yk as a function of k. 

290 . [17] Explain how to find Xk, the unassigned variable of maximum activity that 
is used for flushing literals. Hint: It’s in the HEAP array. 

291. [20] In the text’s hypothetical scenario about flushing Table 3 back to level 15, 
why will 49 soon appear on the trail，instead of 49? 

292. [M21] How large can AGILITY get after repeatedly executing ( 127 )? 

293. [21] Spell out the details of updating Mf to M+Af when deciding whether or not 
to flush. Also compute the agility threshold that’s specified in Table 4. Hint: See ( 131 ). 

294. [HM21 ] For each binary vector a = X 1 X 2 X 3 T 4 ，find the generating function 
9 a{z) = 5^7=0 P a ^ jZ^ where p a j is the probability that Algorithm P will solve the 

J 

seven clauses of ( 7 ) after making exactly j flips, given the initial values a in step PI- 
Deduce the mean and variance of the number of steps needed to find a solution. 

295. [M23] Algorithm P often finds solutions much more quickly than predicted by 
Corollary W. But show that some 3SAT clauses will indeed require ^((4/3) n ) trials. 

296. [HM20] Complete the proof of Theorem U by (approximately) maximizing the 
quantity f(p^q) in ( 129 ). Hint: Consider f(p + 1, q)/f{p^ q)^ 

► 297. [HM26] (Emo Welzl.) Let G q (z) = C p ^ p -\. q -i(z/3) p+q (2z/3) p be the generat¬ 

ing function for stopping time t = 2p + q when Yo = ^ in the proof of Theorem U. 

a) Find a closed form for G q [z), using formulas from Section 7.2*1. 6 . 

b) Explain why G q (l) is less than 1. 

c) Evaluate and interpret the quantity G r q {l)/G q (l). 

d) Use Markov’s inequality to bound the probability that = 0 for some t < N. 

e) Show that Corollary W follows from this analysis- 

298. [HM22] Generalize Theorem U and Corollary W to the case where each clause 
has at most k literals，where k >3. 

299 . [HM23] Continuing the previous exercise, investigate the case k = 2. 

► 300• [25] Modify Algorithm P so that it can be implemented with bitwise operations, 
thereby running (say) 64 independent trials simultaneously. 

► 301. [25] Discuss implementing the algorithm of exercise 300 efficiently on MMIX. 
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302. [26] Expand the text’s high-level description of steps W4 and W5, by providing 
low-level details about exactly what the computer should do. 

303. [HM20] Solve exercise 294 with Algorithm W in place of Algorithm P. 

304. [HM34] Consider the 2 SAT problem with n(n — 1) clauses (xj W Xk) for all j ^ k. 
Find the generating functions for the number of flips taken by Algorithms P and W. 
Hint: Exercises 1.2.6—68 and MPR-105 are helpful for finding the exact formulas. 

► 305. [M25] Add one more clause, (xi V x‘ 2 ), to the previous exercise and find the 
resulting generating functions when n = 4. What happens when p = 0 in Algorithm W? 

► 306. [HM32] (Luby ， Sinclair，and Zuckerman, 1993.) Consider a “Las Vegas algo¬ 
rithm^ that succeeds or fails; it succeeds at step t with probability pt, and fails with 
probability poo < 1- Let qt = pi + P 2 + • - + pt and Et = pi + 2p2 + • • • + tpt] also let 
Eoo = oo if poo > 0, otherwise Eoo = ^2 t tpt- (The latter sum might be oo.) 

a) Suppose we abort the algorithm and restart it again，whenever the first N steps 
have not succeeded- Show that if qN > 0, this strategy will succeed after per¬ 
forming an average of l(N) < oo steps- What is l(N)7 

b) Compute l(N) when p m = —^ = n ~ m ， otherwise pt = 0, where 1 < m < n. 

c) Given the uniform distribution, ^ for 1 < t < n, what is l(N)? 

d) Find all probability distributions such that l(N) = 1(1) for all N >1. 

e) Find all probability distributions such that l(N) = l{n) for all N > n. 

f) Find all probability distributions such that q n +i = 1 and l{n) < l{n + 1). 

g) Find all probability distributions such that 奶 =1 and 1(1) < 1(3) < 1(2). 

h) Let l = infiv>i l(N), and let be the least positive integer such that l(N#) = l, 

or oo if no such integer exists* Prove that 1ST = oo implies l = < oo. 

i) Find iV*for the probability distribution p t = [t>n]/ ((t — n)(t + l —n )) 5 given n> 0. 

j) Exhibit a simple example of a probability distribution for which = oo. 

k) Let L = min^>i t/qt. Prove that l < L < 21 — 1. 

307. [HM28] Continuing exercise 306, consider a more general strategy defined by an 
infinite sequence of positive integers (TVi, %,•••): “Set j 卜 0; then, while success has 
not yet been achieved, set j ^ j +1 and run the algorithm with cutoff parameter Nj?’ 

a) Explain how to compute EX, where X is the number of steps taken before this 
strategy succeeds. 

b) Let T )= 队 … Nj. Prove that EX = < T j)K N j)^ if we 

have qN, >0 for all j. 

j 

c) Consequently the steady strategy (N\ N' …) is best: EX > l(N^) = 1. 

d) Given n，exercise 306(b) defines n simple probability distributions p( m ) that have 

Z(iV*) = n, but the value of = m is different in each case. Prove that any 
sequence (Ni, N 2 , …） must have EX > \nH n — = \lHi — on at least one 

of those p^ m \ Hint: Consider the smallest r such that, for each m, the probability 
is > I that r trial runs suffice; show that > n/ ( 2 m) of {iVi , …， N r } are > m. 

308. [M29] This exercise explores the “reluctant doubling” sequence ( 130 ). 

a) What is the smallest n such that S n = 2 a ， given a > 0? 

b) Show that {n \ S n = 1} = {2k + 1 — | A: > 0}; hence the generating function 

^2 n z n [S n = 1 ] is the infinite product z(l + 之 ）（1 + z 3 )(l + z 7 )(l H- z 10 ).... 

c) Find similar expressions for {n | 5 n = 2 a } and ^2 n z n [S n = 2 a ]. 

d) Let S(a, 6 , k) = Sn, where is the 2 b kth occurrence of 2 a in (5 n ). 

For example, S(l ， 0, 3) =& + ••• + Sio = 16. Evaluate E(a ， 6 , 1) in closed form. 

e) Show that E(a ， 6 , k+1) — E(a ， b^k) < (a + 6 + 2A: — l)2 a+6 , for all k >1. 
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f) Given any probability distribution as in exercise 306 (k)，let a = \\gt~\ and b = 
[lg 1/^1, where t/qt = L\ thus L < 2 a+b < AL. Prove that if the strategy of 
exercise 307 is used with Nj = Sj, we have 

EX < E(a, 6,1) -f- Q k (E(a,b, k+1) — S(a, 6, k)), where Q = (1 — q 2 a ) 2b - 

k>l 

g) Therefore (S n ) gives EX < 131 lg l + 49/, for every probability distribution. 

309 • [20] Exercise 293 explains how to use the reluctant doubling sequence with 
Algorithm C. Is Algorithm C a Las Vegas algorithm? 

310 . [M25] Explain how to compute the “reluctant Fibonacci sequence” 


1,1, 2,1, 2, 3,1,1, 2, 3, 5,1,1, 2,1, 2, 3, 5, 8,1,1, 2,1, 2, 3,1,1, 2, 3, 5, 8,13,1, 


5 


which is somewhat like ( 130 ) and useful as in exercise 308, but its elements are Fibonacci 
numbers instead of powers of 2. 

Sll. [21] Compute approximate values of EX for the 100 probability distributions of 
exercise 306(b) when n = l = 100, using the method of exercise 307 with the sequences 
(S n ) of exercise 308 and (S f n ) of exercise 310. Also consider the more easily generated 
“ruler doubling” sequence (R n )^ where R n = n h —n = 2 pn • Which sequence is best? 

312. [HM24] Let T(m ， n) = EX when the reluctant doubling method is applied to 
the probability distribution defined in exercise 306(b)- Express T(m ， n) in terms of the 
generating functions in exercise 308(c). 

► 313. [22] Algorithm W always flips a cost-free literal if one is present in Cj, without 
considering its parameter p. Show that such a flip always decreases the number of 
unsatisfied clauses, r; but it might increase the distance from x to the nearest solution. 

► 314. [36] (H. H. Hoos ， 1998-) If the given clauses are satisfiable，and if p > 0， can 
there be an initial x for which Algorithm W always loops forever? 

315. [Ml8] What value of p is appropriate in Theorem J when d = 11 

316. [HM20] Is Theorem J a consequence of Theorem L? 

► 317. [M26] Let a(G) = Pr(Ai fl … H A m ) under the assumptions of ( 133 )，when pi = 
p = {d — l、 d - 1 /d d for 1 < i < m and every vertex of G has degree at most d > 1. Prove ， 
by induction on m，that a{G) > 0 and that a{G) > ^-a(G\v) when v has degree < d. 

318. [HM27] (J. B. Shearer.) Prove that Theorem J is the best possible result of its 

kind: If p > (d — l) d ~ l /d d and d > 1, there is a graph G of maximum degree d for 
which (p ， • • • ， p) 多 Hint: Consider complete t-ary trees，where t = d — 1. 

319. [HM20] Show that pde < 1 implies p < (d — l) d ~ l /d d . 

320. [M24] Given a lopsidependency graph the occurrence threshold p(G) is the 
smallest value p such that it’s sometimes impossible to avoid all events when each 
event occurs with probability p. For example, the Mobius polynomial for the path P 3 is 
1—pi —p 2 —P 3 +P 1 P 3 ； so the occurrence threshold is 0_ 2 , the least p with 1 —3p+p 2 < 0- 

a) Prove that the occurrence threshold for P m is 1/(4 cos 2 ^^). 

b) What is the occurrence threshold for the cycle graph C m ? 

321. [M24] Suppose each of four random events A^B^C^D occurs with probability p ， 
where {A^ C} and {B^ D} are independent- According to exercise 320(b) with m = 4, 
there’s a joint distribution of (A, D) such that at least one of the events always 
occurs, whenever p > (2 — V2)/2 ^ 0-293 - Exhibit such a distribution when p = 3/10- 
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► 322. [HM35] (K. Kolipaka and M. Szegedy ， 2011.) Surprisingly 5 the previous exercise 
cannot be solved in the setting of Algorithm M! Suppose we have independent random 
variables {W^X^ Z) such that A depends on W and X， B depends on X and 
C depends on Y and D depends on Z and W. Here W equals j with probability Wj 
for all integers j; X, Y, and Z are similar. This exercise will prove that the constraint 
An B D C C\ D is always satisfiable, even when p is as large as 0.333. 

a) Express the probability Pr(A fl J5 fl (7 fl D) in a convenient way. 

b) Suppose there’s a distribution of W, X , Y, Z with Pr(A) = Pr(B) = Pr(C)= 
Pr(D) = p and Pr(A fl B fl (7 fl D) = 0. Show that there are ten values such that 


0 g a ， 6 , c ， d,a f ,b f ，c’，d’ H 
辦 + (1 — /j)a f < 

"c + (1 — v)c < 
a + d>l or 6 + c > 1 , 
a 7 + (i > 1 or + c > 1 , 


0 < "，p < 1 ， 

// 6 + (1 - ii)b' < p, 
vd+ {1 — v) d! < p, 
a + d! > 1 or 6 + c 7 > 1 , 
a’ + > 1 or 6 ’ + c’ > 1 . 


c) Find all solutions to those constraints when p = 1/3. 

d) Convert those solutions to distributions that have Pr(A fl B fl (7 fl D) = 0. 

323 • [10] What trace precedes ccb in the list ( 135 )? 

► 324. [22] Given a trace a = for a graph explain how to find all strings /3 

that are equivalent to a ， using Algorithm 7.2.1.2V. How many strings yield ( 136 )? 

► 325. [20] An acyclic orientation of a graph G is an assignment of directions to each of 
its edges so that the resulting digraph has no oriented cycles. Show that the number of 
traces for G that are permutations of the vertices (with each vertex appearing exactly 
once in the trace) is the number of acyclic orientations of G. 

326. [20] True or false: If a and /3 are traces with a = p, then a R = /3 R . (See ( 137 ).) 

► 327. [22] Design an algorithm to multiply two traces a and ^ when clashing is defined 
by territory sets T(a) in some universe U. Assume that U is small (say \U\ < 64 )， so 
that bitwise operations can be used to represent the territories* 


328. [20] Continuing exercise 327， design an algorithm that computes a/p. More 
precisely，if ^ is a right factor of a ， in the sense that a = 7 /? for some trace 7 , your 
algorithm should compute 7 ; otherwise it should report that p is not a right factor. 

329. [21] Similarly, design an algorithm that either computes a\ /3 or reports that 
a isn’t a left factor of (3. 

► 330. [21] Given any graph explain how to define territory sets T(a) for its vertices a 
in such a way that we have a = b or a —— b if and only if T(a) flT(fe) ^ 0. (Thus traces 
can always be modeled by empilements of pieces.) Under what circumstances is it 
possible to do this with |T(a)| = 2 for all a, as in the text’s example ( 136 )? 

331. [M20] What happens if the right-hand side of ( 139 ) is expanded without allowing 
any of the variables to commute with each other? 


332. [20] When a trace is represented by its lexicographically smallest string，no letter 
in that representative string is followed by a smaller letter with which it commutes. 
(For example, no c is followed by a in ( 135 )，because we could get an equivalent smaller 
string by changing ca to ac.) 

Conversely, given any ordered set of letters, some of which commute, consider all 
strings having no letter followed by a smaller letter with which it commutes. Is every 
such string the lexicographically smallest of its trace? 
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► 333. [M20] (Carlitz ， Scoville, and Vaughan ， 1976.) Let D be a digraph on {1，• • • ， m }， 
and let A be the set of all strings aj 1 ... aj n such that ji — > ji+i in D for 1 < i < n. 
Similarly let B be the set of all strings aj 1 ... aj n such that ji for 1 < i < n. 

Prove that 

= 1 / = Yli 1 - Zl( _1 ) l/31 ^) 

aeA / 3 eB /c >0 \ / 3 eB 

is an identity in the noncommutative variables {ai, • • • ， a m }. (For example, we have 
l+a + 6 + a6 + 6a + aba + bab + … =(a + 6 — aa — 66 + aaa + bbb —…) k 

k >0 


in the case m = 2 , 1 — > 2 ^ 2 — y 1 , 27 ^ 2 .) 

► 334. [25] Design an algorithm to generate all traces of length n that correspond to a 
given graph on the alphabet { 1 , • • •, m}，representing each trace by its lexicographically 
smallest string. 


335. [HM26] If the vertices of G can be ordered in such a way that x < y < z and 
x-j—y and y—j—z implies x—j—z, show that the Mobius series Mg can be expressed as 


a determinant- For example, 


if 


a 

G = c 
e 



then 


M g 
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► 336. [M20] If graphs G and H on distinct vertices have the Mobius series Mg and Mh , 
what are the Mobius series for (a) G ® H and (b) G —— HI 

337. [M20] Suppose we obtain the graph G r from G by substituting a clique of vertices 
{ai ， …， a ； c} for some vertex a, then including edges from aj to each neighbor of a for 
I < j < k. Describe the relation between M G ，and Mg. 

338. [M21] Prove Viennof s general identity ( 144 ) for source-constrained traces. 

► 339. [HM26] (G. Viennot-) This exercise explores factorization of traces into pyramids. 

a) Each letter Xj of a given trace a = xi ... x n lies at the top of a unique pyramid /3j 
such that pj is a left factor of a. For example, in the trace bcebafdc of ( 136 ), the 
pyramids /3i , …， /?8 are respectively 6 , be, e ， beb, beba, e/ ， bced, and beebde. 
Explain intuitively how to find these pyramidal left factors from a^s empilement. 

b) A labeled trace is an assignment of distinct numbers to the letters of a trace; for 

example, abca might become a^bTCQa^. A labeled pyramid is the special case when 
the pyramid’s top element is required to have the smallest label. Prove that every 
labeled trace is uniquely factorizable into labeled pyramids whose topmost labels 
are in ascending order. (For example, - - /s-) 

c) Suppose there are t n traces of length n ， and p n pyramids. Then there are T n = 
n! t n labeled traces and P n = (n — l)!p n labeled pyramids (because only the 
relative order of the labels is significant). Letting T\z) = E n > 0 T n 2 ： n /n! and 
P{z) = ^ n>1 P n z u /n\^ prove that the number of labeled traces of length n whose 
factorization in part (b) has exactly l pyramids is n! [z n ] P{z) l /l\. 

d) Consequently T(z) = e p( ^ z \ 

e) Therefore (and this is the punch line!) In Mg ( 2 ：) = — ^2 n>1 p n z n /n. 
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► 340. [M20] If we assign a weight w(a) to every cyclic permutation a^ then every per¬ 
mutation 7r has a weight w(7r) that is the product of the weights of its cycles* For 
example, if 7r = ( 3142765 ) ~ (1 342)(57)(6) then w(tt) = w;((134 2))w;((5 7)) , u;((6)). 

The permutation polynomial of a set 5 is the sum of w(tt) over all permutations 
of S. Given any n x n matrix A = (aij ), show that if s possible to define appropriate 
cycle weights so that the permutation polynomial of {1, … ， n} is the determinant of A. 

341 . [M25] The involution polynomial of a set S is the special case of the permuta¬ 
tion polynomial when the cycle weights have the form WjjX for the 1-cycle (j) and 
—Wij for the 2-cycle (i j), otherwise w(a) = 0- For example，the involution polyno¬ 
mial of {1 ， 2, 3, 4} is W11W22W33W44X 4 — W11W22W34X 2 — W11W23W44X 2 — W11W24W33X 2 — 

W 12 W 33 W 44 X 2 — W 13 W 22 W 44 X 2 — W 14 W 22 W 33 X 2 + W 12 W 34 + 1 ^ 13^24 + ^ 14 ^ 23 - 


Prove that, if Wij > 0 for 1 < i < j < n, the involution polynomial of {1, ••” n} 
has n distinct real roots- Hint: Show also that, if the roots for {1，n — 1} are 
qi < • • • < q n -i^ then the roots r/c for { 1 , • • •, n} satisfy ri < qi < r 2 < - < q n - 1 < r n . 

342. [HM25] (Cartier and Foata ， 1969.) Let G n be the graph whose vertices are the 
(Jl) (k — 1 )! cyclic permutations of subsets of { 1 ” •” n } 5 with a ——丁 when a and 
r intersect. For example, the vertices of G 3 are (1) ， (2), (3) ， (12) ， (13) ， (23) ， (123 )， 
(132); and they’re mutually adjacent except that (1) —j— (2), (1) —j— (3), (1) —j— (23), 
(2) —j— (3), (2) —j— (13), (12) -j— (3). Find a beautiful relation between Mc n and the 
characteristic polynomial of an n x n matrix. 

► 343. [M25] If G is any cograph, show that (pi，• • • ,Pm) G 1Z{G) if and only if we have 
Mg 〈 Pi ， … ^Pm) > 0. Exhibit a non-cograph for which the latter statement is not true. 

344. [MSS] Given a graph G as in Theorem S, let … ， B m have the joint probabil¬ 
ity distribution of exercise MPR—31, with 717 = 0 whenever / contains distinct vertices 
{i^j} with i —— j, otherwise 717 = Y\ ieI Pi- 

a) Show that this distribution is legal (see exercise MPR—32) if (pi ， •… ,p m ) G 7Z(G). 

b) Show that this “extreme distribution” also satisfies condition ( 147 ). 

c) Let /3(G) = Pr(_Bi fl- - If J [ {1 ， … ， m}，express j3(G\J) in terms of Mg. 

d) Defining a(G) as in exercise 317, with events Aj satisfying ( 133 ) and probabilities 
Q?i，• • • ， Pm) G 71(G), show that a(G \ J) > /3(G \ J) for all J g {1，• • • ， m}. 



If pi satisfies ( 134 ), show that p{G\J) - n < 7 Gj( l _ ^)- 


345. [M30] Construct unavoidable events that satisfy ( 147 ) when (j?i ， … ,p m ) ^ 1Z{G). 

► 346. [HM28] Write ( 142 ) as Mg = M G \ a {l — aK a ,G 、where K a ,G = M G \ a * /M G \ a . 

a) If (j?i ， … ， Pm) G prove that K a ^o is monotonic in all of its parameters: It 

does not increase if any of pi, … ， p m are decreased- 

b) Exploit this fact to design an algorithm that computes … ， p m ) and 

decides whether or not (pi ， … ， i?m) G ^(G), given a graph G and probabilities 
(Pi ， … 5 Pm)- Illustrate your algorithm on the graph G = P^U P 2 of exercise 335. 


► 347. [M28] A graph is called chordal when it has no induced cycle Ck for k > 3. 
Equivalently (see Section 7.4.2)，a graph is chordal if and only if its edges can be 
defined by territory sets T(a) that induce connected subgraphs of some tree. For 
example, interval graphs and forests are chordal. 

a) Say that a graph is tree-ordered if its vertices can be arranged as nodes of a forest 
in such a way that 
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a —— b implies a y b or b y a; 
a y b y c and a —— c implies a —— b. 


(*) 
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(Here c a y means that a is a proper ancestor of b in the forest.) Prove that 
every tree-ordered graph is chordal. 

b) Conversely，show that every chordal graph can be tree-ordered. 

c) Show that the algorithm in the previous exercise becomes quite simple when it 
is applied to a tree-ordered graph, if a is eliminated before b whenever a y b. 

d) Consequently Theorem L can be substantially strengthened when G is a chordal 

graph: When G is tree - ordered by the probability vector (pi, … ， p m ) is in 
7Z(G) if and only if there are numbers 0 < ••• < 1 such that 

Pi = Gi n (! - ^i)- 

i — j inG, iyj 

348. [HM26] (A. Pringsheim ， 1894.) Show that any power series f(z) = a nZ n 

with a n > 0 and radius of convergence p, where Q < p < oo, has a singularity at 2 ： = p. 

► 349. [M24] Analyze Algorithm M exactly in the two examples considered in the text 

(see ( 150 )): For each binary vector x = xi ... X 7 ^ compute the generating function 
9 x{z) = where p x ,t is the probability that step M3 will be executed exactly 

t times after step Ml produces x. Assume that step M2 always chooses the smallest 
possible value of j. (Thus the c Case T scenario in ( 150 ) will never occur.) 

What are the mean and variance of the running times, in (i) Case 1? (ii) Case 2? 

► 350. [HM26] (W- Pegden.) Suppose Algorithm M is applied to the m = n + 1 events 

Aj = Xj for 1 < j < n; A m = xi V • • • V x n . 

Thus A m is true whenever any of the other Aj is true; so we could implement step M2 
by never setting j ^ m. Alternatively，we could decide to set j ^ m whenever possible. 
Let (iVi, iVn, iViii ， iVi” N v ) be the number of resamplings performed when parameter ^ 
of the algorithm is (i) 1 / 2 ; (ii) l/( 2 n); (iii) l/ 2 n ; (iv) l/(n + &); (v) l/(n + k) 2 . 

a) Find the asymptotic mean and variance of each TV, if j is never equal to m. 

b) Find the asymptotic mean and variance of each if j is never less than m. 

c) Let G be the graph on {1， …， n + 1} with edges j —— （n + 1) for 1 < j < n, and 
let pj = Pr(Aj). For which of the five choices of ^ is Q?i ， … ， p n +i) G 71(G)? 

► 351. [25] The Local Lemma can be applied to the satisfiability problem for m clauses 
on n variables if we let Aj be the event Ci Cj is not satisfied •” The dependency graph G 
then has i —— j whenever two clauses Ci and Cj share at least one common variable. 
If, say, Ci is (X 3 V ^5 V xq)^ then ( 133 ) holds whenever pi > (1 — $ 3 )$ 5(1 — ⑸, assuming 
that each x/c is true with probability independent of the other x’s. 

But if ， say ，Cj is ( 无 2 V X 3 V X 7 ), condition ( 133 ) remains true even if we don’t 
stipulate that i —— j. Variable X 3 appears in both clauses, yet the cases when Cj is 
satisfied are never bad news for Ci. We need to require that i —— j in condition ( 133 ) 
only when Ci and Cj are “resolvable” clauses, namely when some variable occurs 
positively in one and negatively in the other. 

Extend this reasoning to the general setting of Algorithm M, where we have 
arbitrary events Aj that depend on variables Ej : Define a lopsidependency graph G for 
which ( 133 ) holds even though we might have i—j —3 in some cases when 三 《 fl 三 j — 0 . 

352. [M21] Show that Ej < Oj /(I — Oj) in ( 152 ), when ( 134 ) holds. 

353. [M21] Consider Case 1 and Case 2 of Algorithm M as illustrated in ( 150 ). 

a) How many solutions xi ... x n are possible? (Generalize from n = 7 to any n.) 

b) How many solutions are predicted by Theorem S? 
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c) Show that in Case 2 the lopsidependency graph is much smaller than the depend¬ 
ency graph. How many solutions are predicted when the smaller graph is used? 

354. [HM20] Show that the expected number E iV of resampling steps in Algorithm M 
is at most (1)/M5(1). 

355. [HM21 ] In (152 )， prove that Ej < 1 /S when (pi ， …， Pm) has positive slack S. 
Hint: Consider replacing pj by pj + Spj. 

► 356. [MSS] (The Clique Local Lemma.) Let G be a graph on and let 

G \ Ui^ ... ^ G \ Ut be cliques that cover all the edges of G. Assign numbers Oij > 0 to 
the vertices of each Uj, such that Ej = ^2 ieU . Oij < 1. Assume that 

Pr(Ai) = pi < Oij (1 + Oik — ^k) whenever 1 < i < m and i G t/j. 

ieUk 


a) Prove that (j?i， •… € 1Z(G). Hint: Letting As denote f] ieS ^i，show that 


Pr(Ai \ As) < 6ij whenever 1 <i < m and i G Uj and S DUj = 

b) Also Ei in ( 152 ) is at most mini — j inG Oij /(I — Ej). (See Theorems M and K.) 

c) Improve Theorem L by showing that，if 0< ^ < 1 5 then (pi ， … ， Pm) ^1Z(G) when 


Pi 



► 357. [M20] Let x = and y = tt v in ( 155 )，and suppose the field of variable v is 
(p ， g ， r). Express x and y as functions of p, and r. 

358. [M20] Continuing exercise 357, prove that r = max(p, r) if and only if 1 / > 

359. [20] Equations ( 156 ) and ( 157 ) should actually have been written 


h-^c = 


(1 - 7r r ) (1 - ⑺) Uiec^c^ 1 - 
7T r + (1 -7T r )(l - V^Uiec^ci 1 -VC-^l) 


and 7 ]c-^i = H me, 

C3V^l 


to avoid division by zero. Suggest an efficient way to implement these calculations- 

360. [M23] Find all fixed points of the seven-clause system illustrated in ( 159 ), given 
that 7 Ti = 7 r *2 = 714 = 1 - Assume also that rhijj = 0 for all 1. 

► 361. [M22] Describe all fixed points rj c _^ l = rj f c _^ l of the equations ( 154 ) ，（ 156 ) ， ( 157 ), 
for which each rjc-^i and each r/i is either 0 or 1 . 

362. [20] Spell out the computations needed to finish Algorithm S in step S 8 . 

► 363. [M30] (Lattices of partial assignments.) A partial assignment to the variables 
of a satisfiability problem is called stable (or “valid”）if it is consistent and cannot be 
extended by unit propagation. In other words, it’s stable if and only if no clause is 
entirely false，or entirely false except for at most one unassigned literal- Variable Xk of 
a partial assignment is called constrained if it appears in a clause where 士 x/c is true 
but all the other literals are false (thus its value has a “reason”）. 

The 3 n partial assignments of an n-variable problem can be represented either as 
strings x = xi ... x n on the alphabet {0, 1， *} or as sets L of strictly distinct literals. For 
example，the string x = *1*01* corresponds to the set L = {2, 4, 5}. We write x -< x 
if x is equal to x except that x k = * and x k G {0,1}; equivalently L X L f if L f = L U k 
or L f = LUk. Also x O x' if there are t > 0 stable partial assignments x。) with 

( 0 ) ^ ( 1 ) ^ ^ ⑴ / 

X = x K J -< x K J ^ • • • -<( X K = X . 
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d) 

e) 

f) 

g) 


Let pi^ ^ p n ^ qi^ ^ q n be probabilities, with p k -\-q k = 1 for 1 < A: < n- Define 
the weight W (x) of a partial assignment to be 0 if x is unstable, otherwise 

W(x) = PJ{p/c I x/c = *} • I x/c * and Xk is unconstrained}. 

[E. Maneva, E. Mossel, and M. J. Wainwright, in JACM 54 (2007), 17:1-17:41, studied 
general message-passing algorithms on partial assignments that are distributed with 
probability proportional to their weights, in the case pi = ••• = p n = p, showing that 
survey propagation (Algorithm S) corresponds to the limit as p —> 1-] 

a) True or false: The partial assignment specified by the literals currently on the 
trail in step C5 of Algorithm C is stable. 

b) What weights W(x) correspond to the clauses F in ( 1 )? 

c) Let a; be a stable partial assignment with Xk = 1, and let x and x" be obtained 

from x by setting 0, xl ，*. True or false: x/c is unconstrained in x if and 

only if (i) x is consistent; (ii) x is stable; (iii) x n is stable. 

If the only clause is 123 = (xi V X 2 V X 3 )，find all sets L such that L □ {1, 2, 3}. 
What are the weights when there’s only a single clause 123 = (xi V X 2 V 
Find clauses such that the sets L with L □ {1 ， 2,3,4, 5} are 0 ， {4}, {5}, {1,4}, 
{2,5}, {4,5}, {1,4,5}, {2,4,5}, {3,4,5}, {1,3,4, 5}, {2, 3,4,5}, {1,2,3,4, 5}. 

Let £ be a family of sets C {1 ， . … ， n}, closed under intersection, with the property 
that L £ C implies L = L( 0 ) X l/ 1 ) X ^ L(’）= {1,, n} for some L ⑴ £ C. 
(The sets in (d) form one such family，with n = 5.) Construct strict Horn clauses 
with the property that L G £ if and only if L □ {1，• • • ， n}. 

h) True or false: If L, I/ ， L" are stable and 1/ 弋 L ， L 〃 L, then 1/ fl L 〃 is stable. 

i) If 1/ O L and L n □ L, prove that 1/ fl I/" E L. 

j) Prove that W{x) = Yl{pk \ Xk = *} whenever x is stable. 

► 364. [M21] A covering assignment is a stable partial assignment in which every as¬ 
signed variable is constrained. A core assignment is a covering assignment L that 
satisfies L □ L 7 for some total assignment L r . 

a) True or false: The empty partial assignment L = 0 is always covering. 

b) Find all the covering and core assignments of the clauses F in (i). 

c) Find all the covering and core assignments of the clauses R f in ( 7 ). 

d) Show that every satisfying assignment L r has a unique core- 

e) The satisfying assignments form a graph, if two of them are adjacent when they 
differ by complementing just one literal- The connected components of this graph 
are called clusters. Prove that the elements of each cluster have the same core. 

f) If Ll and L" have the same core, do they belong to the same cluster? 

365. [M27] Prove that the clauses waerden(3^ 3; n) have a nontrivial (i.e. ， nonempty) 
covering assignment for all sufficiently large n (although they’re unsatisfiable). 

► 366. [18] Preprocess the clauses R f of ( 7 ) - What erp rules are generated? 

► 367. [20] Justify the erp rule ( 161 ) for elimination by resolution. 

368. [16] Show that subsumption and downhill resolution imply unit conditioning: 
Any preprocessor that does transformations 2 and 4 will also do transformation 1. 

► 369. [21 ] (N. Een and A. Biere.) Suppose l appears only in clauses Ch, … ， C p and l 
appears only in clauses C、, …， C、, where we have C\ — (/V/i V- • •Mir) and Cj = 

for 1 < j < r- Prove that we can eliminate |/| by using the erp rule l ^ (/1 V • • • V l r ) 
and replacing those p ^ q clauses by only (p — 2 )r + q others，namely 

{Cl oC'j\r <j <q] U {C { o Cj \ 1 < i < p,l < j < r}. 
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(The case r = 1 is especially important. In many applications — for example in the 
examples of fault testing ， tomography，and the “Life in 4” problem about extending 
Fig. 35 — more than half of all variable eliminations admit this simplification.) 

370. [20] The clauses obtained by resolution might be needlessly complex even when 
exercise 369 doesn’t apply. For example, suppose that variable x appears only in the 
clauses (x W a) A (x W aW c) A (x W b) A (x W b\/ c). Resolution replaces those four clauses 
by three others: (a V 6 ) A (a V 6 V c) A (a V 6 V c). Show ， however, that only two clauses ， 
both binary, would actually suffice in this particular case. 

371. [24] By preprocessing repeatedly with transform at ions 1—4, and using exercise 
369, prove that the 32 clauses ( 9 ) of waerden(3^ 3; 9) are unsatisfiable. 

372. [25] Find a “small” set of clauses that cannot by solved entirely via transforma¬ 
tions 1-4 and the use of exercise 369. 

373. [25] The answer to exercise 228 defines 2m + — l ) 2 ^ m 3 /3 clauses 

in m 2 variables that suffice to refute the anti-maximal-element axioms of ( 99 )-( 101 )- 
Algorithm L needs exponential time to handle these clauses, according to Theorem R; 
and experiments show that they are bad news for Algorithm C too. Show, however, 
that preprocessing with transformations 1-4 will rapidly prove them unsatisfiable. 

► 374. [32] Design data structures for the efficient representation of clauses within a SAT 
preprocessor. Also design algorithms that (a) resolve clauses C and C f with respect to 
a variable x\ (b) find all clauses C r that are subsumed by a given clause C\ (c) find all 
clauses C f that are self-subsumed by a given clause C and a literal l G C. 

375. [21] Given |Z|，how can one test efficiently whether or not the special situation in 
exercise 369 applies, using (and slightly extending) the data structures of exercise 374? 

► 376. [32] After a preprocessor has found a transformation that reduces the current set 
of clauses, it is supposed to try again and look for further simplifications- (See ( 160 ).) 
Suggest methods that will avoid unnecessary repetition of previous work, by using (and 
slightly extending) the data structures of exercise 374. 

377. [22] (V. Vassilevska Williams.) If (7 is a graph with n vertices and m edges ， 
construct a 2SAT problem F with 3n variables and 6 m clauses，such that G contains a 
triangle (a 3-clique) if and only if F has a failed literal- 

378. [20] (Blocked clause elimination.) Clause (7 = (/ V /1 V • • • V is said to be 
blocked by the literal l if every clause that contains l also contains either 『1 or ••• or 
Exercise 161(b) proves that clause C can be removed without making an unsatisfiable 
problem satisfiable. Show that this transformation requires an erp rule, even though it 
doesn’t eliminate any of the variables. What erp rule works? 

► 379. [20] (Blocked s elf - subsumption •) Consider the clause (a V 6 Vc Vd)，and suppose 
that every clause containing a but not b nor c also contains d. Show that we can then 
shorten the clause to (6 V c V d) without affecting satisfiability. Is an erp rule needed? 

380. [21] Sometimes we can use self-subsumption backwards, for example by weaken¬ 
ing the clause (/ 1 V /2 V/ 3 ) to (/ 1 V- • -V4) if each intermediate replacement of (/ 1 V- • -V/j) 
by (Zi V. • .Wj-i) is justifiable for 3 < j < k. Then，if we’re lucky, the clause (Zi V. - -V/^) 
is weak enough to be eliminated; in such cases we are allowed to eliminate (h V 12 V I 3 ) - 

a) Show that (a V 6 V c) can be eliminated if it is accompanied by the additional 
clauses (a V 6 V J)，（a V d V e )， (b\/ dW e). 

b) Show that (a V 6 V c) can also be eliminated when accompanied by (a V 6 V J )， 
(a V c V J), (6 V d V e), (6 V c V e), provided that no other clauses contain c. 
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c) What erp rules, if any, are needed for those eliminations? 

381 . [22] Combining exercises 379 and 380, show that any one of the clauses in 

(xi V X 2 ) A (X 2 V x 3 ) A • • • A {xn -1 v x n ) A (x n V Xi) 

can be removed if there are no other clauses with negative literals. State the erp rules. 

382 . [30] Although the techniques in the preceding exercises are computationally 
difficult to apply, show that a lookahead forest based on the dependency digraph can 
be used to discover some of those simplifications efficiently- 

► 383 . [23] (Inprocessing.) A SAT solver can partition its database of current clauses 
into two parts, the “hard” clauses $ and the “soft” clauses Initially ^ is empty，while 
$ is F, the set of all input clauses. Four kinds of changes are subsequently allowed: 

• Learning. We can append a new soft clause provided that $ U ^ U (7 is 
satisfiable whenever $ U 屯 is satisfiable. 

• Forgetting. We can discard (purge) any soft clause. 

• Hardening. We can reclassify any soft clause and call it hard- 

• Softening. We can reclassify any hard clause C and call it soft, provided that 
$ is satisfiable whenever $ \ (7 is satisfiable. In this case we also should output any 
necessary erp rules, which change the settings of variables in such a way that any 
solution to $ \ (7 becomes a solution to $• 

a) Prove that, throughout any such procedure, F is satisfiable $ is satisfiable 

$ U 屮 is satisfiable. 

b) Furthermore, given any solution to $， we obtain a solution to F by applying the 
erp rules in reverse order. 

c) What is wrong with the following scenario? Start with one hard clause, (x)，and 
no soft clauses. Reclassify (x) as soft, using the erp rule x ^ 1. Then append a 
new soft clause (x). 

d) If C is certifiable for $ (see exercise 385)，can we safely learn Cl 

e) If C is certifiable for $ \ ( 7 , can we safely forget Cl 

f) In what cases is it legitimate to discard a clause, hard or soft, that is subsumed 
by another clause, hard or soft? 

g) In what cases is self-subsumption permissible? 

h) Explain how to eliminate all clauses that involve a particular variable x. 

i) Show that, if 2 ; is a new variable, we can safely learn the three new soft clauses 
(x V z\ {y V z\ [xM z) in Tseytin’s concept of extended resolution. 

384 . [25] Continuing the previous exercise, show that we can always safely forget any 
clause C that contains a literal l for which C o C r is certifiable for $ \ (7 whenever 
C 7 G $ contains l. What erp rule is appropriate? 

385. [22] Clause C is called certifiable for a set of clauses F if F A (7 hi as in ( 119 ). 
It is said to be absorbed by F if it is nonempty and F A C \ l hi l for every / G (7, or if 
it is empty and F hi e. (Every clause of F is obviously absorbed by F.) 

a) True or false: If C is absorbed by F, it is certifiable for F. 

b) Which of {1,12,123} are implied by, certifiable for, or absorbed by R r in (7)? 

c) If C is certifiable for F and if all clauses of F are absorbed by F,, prove that C 
is certifiable for F r . 

d) If C is absorbed by F and if all clauses of F are absorbed by F 、 prove that C is 
absorbed by F f • 
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► 386. [M25] Let Algorithm Co be a variant of Algorithm C that (i) makes all decisions 
at random; (ii) never forgets a learned clause; and (iii) restarts whenever a new clause 
has been learned. (Thus, step C5 ignores M p and Mf ； step C6 chooses l uniformly at 
random from among the 2(n—F) currently unassigned literals; step C8 backjumps while 
F > h, instead of while F > and after step C9 has stored a new clause, with 

d > 0, it simply sets d <— 0 and returns to C2. The data structures HEAP, OVAL, and ACT 
are no longer used.) We will prove that Algorithm Co is ， nevertheless, quite powerful- 
In the remainder of this exercise, F denotes the set of clauses known by Algo¬ 
rithm Co, both original and learned; in particular, the unit clauses of F will be the 
first literals Lq, Li, on the trail. If C is any clause and if / G ( 7 , we define 


score(F, l )= 



oo ， 

\{i f \fa~c\1 hi r>| 


ii FA C\l hi /; 
otherwise. 


Thus score(F, (7, l) represents the total number of literals on the trail after making all 
the unforced decisions of C\l^ if no conflict arises. We say that Algorithm Co performs 
a “helpful round” for C and l if (i) every decision literal belongs to C\ and (ii) l is 
chosen as a decision literal only if the other elements of C are already in the trail 

a) Let C be certifiable for F, and suppose that score(F, (7,/) < oo for some l G C. 
Prove that if F r denotes F together with a clause learned on a helpful round, 
then score C,l) > score (F, C,V). 

b) Furthermore score(F 7 , C^l) > score (F, (7, l) after an unhelpful round- 

c) Therefore C will be absorbed by the set F f of known clauses after at most \C\n 
helpful rounds have occurred. 

d) If \C\ = A:, show that Pr(helpful round) > (k — l)\/(2n) k > 1 /(4n k ). 

e) Consequently, by exercise 385(c), if there exists a certificate of unsatisfiability 
((7i ， •… ， Ct) for a family of clauses F with n variables 5 Algorithm Co will prove 
F unsatisfiable after learning an average of /x < 4 Y^\=i \Ci\n 1+ ^ Ci ' clauses. (And 
it will q.s- need to learn at most # Inn In Inn clauses, by exercise MPR—102.) 

► 387. [21] Graph G is said to be embedded in graph G r if every vertex v oi G corre¬ 
sponds to a distinct vertex v of G’, where u —— v in G r whenever u —— v in G. Explain 
how to construct clauses that are satisfiable if and only if G can be embedded in G’ • 

388• [20] Show that the problems of deciding whether or not a given graph G (a) con¬ 
tains a A: - clique ， (b) can be A: - colored, or (c) has a Hamiltonian cycle can all be regarded 
as graph embedding problems. 

► 389. [22] In this 4x4 diagram, it’s possible to trace out the phrase 
‘THE u ART u 0F u C0MPUTER u PR0GIlAiy[MI]\[G’ by making only king moves and 
knight moves, except for the final step from N to G. 

Rearrange the letters so that the entire phrase can be traced. 

► 390. [23] Let G be a graph with vertices V, edges \E\ = m, |T^| = n, and s^t ^ V. 

a) Construct 0{kn) clauses that are satisfiable if and only if there’s a path of length 
k or less from s to ^ given k. 

b) Construct O(m) clauses that are satisfiable if and only if there’s at least one path 
from 5 to t. 

c) Construct 0(n 2 ) clauses that are satisfiable if and only if G is connected- 

d) Construct 0(km) clauses that are wnsatisfiable if and only if there’s a path of 
length k or less from s to ^ given k. 
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e) Construct O(m) clauses that are wnsatisfiable if and only if there’s at least one 
path from s to t. 

f) Construct 0{m) clauses that are wnsatisfiable if and only if G is connected. (This 
construction is much better than (c)，in a sparse graph.) 

391 . [M25] The values of two integer variables satisfy 0 < x^y < and they are to 
be represented as /-bit quantities - . - xo, yi — i •••?/(), where l = rig d]. Specify three 
different ways to encode the relation x ^ y: 

a) Let x = (xi-i... xo )2 and y = (yi-i . - - yo)‘ 2 ; and let the encoding enforce the 
conditions (x/ 一 i • • • xo )‘2 < d，and (yi-i - - . yo )2 < as well as ensuring that 
x ^ y by introducing 2 / + 1 additional clauses in l auxiliary variables- 

b) Like (a)，but there are d additional clauses (not 2Z + 1), and no auxiliaries- 

c) All bit patterns xi-i ... xo and yi- \ … yo are valid, but some values might have 
two different patterns- The encoding has d clauses and no auxiliary variables- 

392 • [22] The blank spaces in the following diagrams can be filled with letters in such 
a way that all occurrences of the same letter are rookwise connected: 



a) Demonstrate how to do it- (Puzzle (i) is easy; the others less so.) 

b) Similarly, solve the following puzzles — but use kin gw is e connectedness instead. 
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(viii) 


c) Construct clauses with which a SAT solver can solve general puzzles of this kind: 
Given a graph G and disjoint sets of vertices Ti, T^， …， a solution should ex¬ 
hibit disjoint connected sets of vertices Si, S 2 , ••” Si, with Tj C Sj for I < j < t. 

393. [25] (T. R. Dawson ， 1911.) Show that it’s possible for each white 
piece in the accompanying chess diagram to capture the corresponding 
black piece, via a path that doesn’t intersect any of the other paths- How 
can SAT help to solve this problem? 

394. [25] One way to encode the at-most-one constraint ^ y p ) 

is to introduce l = |"lgp] auxiliary variables together with the following nl + n — 2 l 
clauses, which essentially “broadcast” the value of j when yj becomes true: 



(yj V (-l) bt a t ) for 1 < j < p, 1 < t < g = [lg(2 p-j)J, where 2p - j = (I 61 . - . b q ) 2 . 


For example, the clauses when p = 3 are (沒 1 Vai) 八(豆 1 Vd‘ 2 ) 八 ( 沒 2 Vai) 八 ( 沒 2 Va‘ 2 ) 八 ( 沒 3 Vdi). 

Experiment with this encoding by applying it to Langford’s problem，using it in 
place of ( 13 ) whenever p >7. 

395. [20] What clauses should replace ( 15 ) ，（ 16 )，and ( 17 ) if we want to use the order 
encoding for a graph coloring problem? 


connected 

encode 

rookwise connected 
multicommodity flow 
routing, disjoint 
connection puzzles 
king wise connectedness 
Dawson 
chess diagram 
nonintersecting paths 
at-most-one 
auxiliary variables 
broadcast 

Langford’s problem 
order encoding 


September 23, 2015 










7.2.2.2 


SATISFIABILITY: EXERCISES 


171 


► 396. [23] (Double clique hints.) If x has one of the d values {0, 1，…， d — 1}，we can 

represent it bin ary wise with respect to two different orderings by letting x J = [x>j] 
and x J = [x7T >j] for 1 < j < d, where 7 r is any given permutation. For example, if 
d = A and (0 兀， 1 兀， 2 兀， 3 兀 ) =(2, 3, 0, 1)，the representations x 1 x 2 x 3 :x 1 x 2 x 3 of 0, 1, 2, 
and 3 are respectively 000:110, 100:111, 110:000, and 111:100. This double ordering 
allows us to encode graph coloring problems by including not only the hints ( 162 ) but 
also _ _ 

(vf~ k+1 V ... V V d k ~ k+1 ) A d 1 V ... V H 

whenever the vertices { 仍， … ，外 } form a A:-clique. 

Explain how to construct clauses for this encoding, and experiment with coloring 
the n x n queens graph when ( 0 兀， 1 兀， 2 兀， 3 兀 , 4 兀 , • • •) = ( 0 , d— 1 ， 1 ， d— 2 , 2 , • • •) is the 
inverse of the organ-pipe permutation* 

► 397. [22] (N. Tamura, 2014.) Suppose xo^ xi，• • • ， x p -i are integer variables with the 
range 0 < Xi < represented in order encoding by Boolean variables x\ = [xi >j] 
for 0 < i < p and 1 < j < (i. Show that the all-different constraint, u Xi ^ Xj for 
0 < i < j < p，” can be nicely encoded by introducing auxiliary integer variables yo, 
yi, … ， yd-i with the range 0 < yj < p, represented in order encoding by Boolean 
variables y) = [yj > i] for 1 < i < p and 0 < j < and by devising clauses to enforce 
the condition Xi = j => yj = i. Furthermore, hints analogous to ( 162 ) can be given. 

398. [18] Continuing exercise 397, what’s an appropriate way to enforce the all- 
different constraint when xo, … ， x p -i are represented in the direct encoding? 

► 399. [23] If the variables u and v range over d values {1, •… ， d} ， it’s natural to encode 
them directly as sequences u\ .. .Ud and v\ … where Ui = [u = i] and Vj = [v = j], 
using the at-least-one clauses ( 15 ) and the at-most - one clauses ( 17 ). A binary constraint 
tells us which pairs (i,j) are legal; for example, the graph-coloring constraint says that 
i ★ j when i and j are the colors of adjacent vertices in some graph* 

One way to specify such a constraint is to assert the preclusion clauses {ui V Vj) 
for all illegal pairs (之 j)，as we did for graph coloring in ( 16 ). But there’s also another 
general way: We can assert the support clauses 


d ( 

八 ( 仏 v Vte 



\j{ui I (ij) is legal}) 


instead. Graph coloring with d colors would then be represented by clauses such as 
(^3 V vi V i ；2 V ^4 V • • • V Vd), when u and v are adjacent- 

a) Suppose t of the d 2 pairs (i,j) are legal- How many preclusion clauses are needed? 
How many support clauses? 

b) Prove that the support clauses are always at least as strong as the preclusion 
clauses, in the sense that all consequences of the preclusion clauses under unit 
propagation are also consequences of the support clauses under unit propagation, 
given any partial assignment to the binary variables { 乜 1 ， … ，似，仍， … ，如 }• 

c) Conversely, in the case of the graph-coloring constraint, the preclusion clauses 
are also at least as strong as the support clauses (hence equally strong) • 

d) However, exhibit a binary constraint for which the support clauses are strictly 
stronger than the preclusion clauses. 


400 • [25] Experiment with preclusion clauses versus support clauses by applying them 
to the n queens problem. Use Algorithms L ， C，and W for comparison. 

401 . [16] If x has the unary representation x l x 2 ... x d ~ l ^ what is the unary represen¬ 
tation of (a) y = \x/2~\? (b) 2 ： = [(x + 1)/3 」？ 
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402. [18] If x has the unary representation x l x 2 ... x d ~ l ^ encode the further condition 
that x is (a) even; (b) odd. 

403 • [20] Suppose x, y, z have the order encoding，with 0 < x^y^z < d. What clauses 
enforce (a) min(x 5 y) < z? (b) max(x 5 t/) < z? (c) min(x, y) > z? (d) max(x, y) > zl 

404. [21] Continuing exercise 403， encode the condition \x — y\ > a, for a given 
constant a > 1, using either (a) d clauses of length < 4 and no auxiliary variables; 
or (b) 2d — 0{a) clauses of length < 3, and one auxiliary variable. 

405. [M23] The purpose of this exercise is to encode the constraint ax + by < c, when 
a,h, c are integer constants，assuming that y are order-encoded with range [0 •. c?). 

a) Prove that it suffices to consider cases where a, 6, c > 0. 

b) Exhibit a suitable encoding for the special case 13x — 8y < 7^ d = 8. 

c) Exhibit a suitable encoding for the special case 13x — 8y > 1^ d = 8. 

d) Specify an encoding that works for general a,h, c, d. 

406. [M24] Order-encode (a) xy < a and (b) xy > a, when a is an integer constant. 

407. [M22] If z are order-encoded, with 0<x^y<d and 0<z<2d — l^ the clauses 


2d-2 


k 


A A v y k 

k = l j=max(0,/c + l —d) 


y z k ) 


are satisfiable if and only if x+y < z\ this is the basic idea underlying ( 20 ). Another way 
to encode the same relation is to introduce new order-encoded variables u and and to 
construct clauses for the relations L x /2 」 + |j//2 」 < u and \x/2] + \y/2~\ < recursively 
using methods for numbers less than \d/2] and l_d/2」 + 1- Then we can finish the job 
by letting 2： 1 = v 1 ， z 2d ~ 2 = v d (d even) or u d ~ l (d odd), and appending the clauses 


( 丑 J V z 2 ^) A (t ; J+1 V 2 ： 2j ) A {u^ V v^ +1 V 2 ： 2j+1 ), for 1 < j < d — 2. 

a) Explain why the alternative method is valid. 

b) For what values of d does that method produce fewer clauses? 

c) Consider analogous methods for the relation x + y > z. 

408. [25] (Open shop scheduling.) Consider a system of m machines and n jobs ， 
together with an m x n matrix of nonnegative integer weights W = (wij) that represent 
the amount of uninterrupted time on machine i that is needed by job j. 

The open shop scheduling problem seeks a way to get all the work done in t units 
of time, without assigning two jobs simultaneously to the same machine and without 
having two machines simultaneously assigned to the same job. We want to minimize ^ 
which is called the “makespan” of the schedule. 

( 703 \ 

172 j • A “greedy” algorithm, which 

235 J 

repeatedly fills the lexicographically smallest time slot j) such that Wij > 0 but 
neither machine i nor job j have yet been scheduled at time ^ achieves a makespan 
of 12 with the following schedule: 


Ml 


M2 


M3 





J1 





J3 






J2 




J1 



J3 



J3 






J2 


J1 


a) Is 12 the optimum makespan for this W? 
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b) 

c) 

d) 

e) 


Prove that the greedy algorithm always produces a schedule whose makespan is 
less than (max£i X ^ =1 Wij) + (maxj =1 Y1T=i w ij), unless W is entirely zero. 
Suppose machine i begins to work on job j at time Sij, when Wij > 0. What 
conditions should these starting times satisfy, in order to achieve the makespan t? 
Show that the order encoding of these variables Sij yields SAT clauses that nicely 
represent any open shop scheduling problem. 

Let L^V^」be the matrix obtained by replacing each element Wij of W by Lwo / 众 」 • 
Prove that if the open shop scheduling problem for LW/A:」and t is unsatisfiable ， 
so is the problem for W and kt. 


► 409. [M26] Continuing exercise 408, find the best makespans in the following cases: 

a) m = 3, n = 3r + 1; wij = w 2 ( r +j) = ^ 3 ( 2 r+j) = for 1 < j < r; wi n = W 2 n = 

W 3 n = L( a i H - ha r )/2 」； otherwise Wij = 0- (The positive integers aj are given.) 

b) m = 4， n = r + 2; wij = (r + l)aj and W 2 j = 1 for 1 < j < r; W 2 ( n -i) = W 2 n = 

(r + l)L(«i H -+ a r )/2j; w 3 ( n 一 q = w\ n = W 2 n + otherwise Wij = 0. 


c) 

d) 


1 for 1 < j < n; otherwise Wij = 0 . 


m = n; Wjj = n — 2, Wj n = w n j 二 
m = 2 ; wij = aj and w‘ 2 j = bj for 1 < j < n, where ai + • • -+a n 
and aj + bj < s for 1 < j < n. 


6 iH - \-b n = 5 


410. [24] Exhibit clauses for the constraint 13x —g 7 when x and y are log-encoded 
as 3-bit integers x = (x 2 XiXo )2 and y = (?/ 2 |/i?/o) 2 - (Compare with exercise 405(b).) 


► 411. [25] If x = {x m - - - 3 : 1 ) 2 , y = (?/n • • • ?/ 1 ) 2 ， and 2 ： = (z m+n - - - zi) 2 stand for 
binary numbers, the text explains how to encode the relation xy = z with fewer than 
20mn clauses, using Napier—Dadda multiplication. Explain how to encode the relations 
xy < z and xy > z with fewer than 9mn and limn clauses, respectively- 


412. [40] Experiment with the encoding of somewhat large numbers by using a radix-d 
representation in which each digit has the order encoding. 


413. [M20] How many clauses will remain after the auxiliary variables ai ， … ， a n -i 
of ( 169 ) have been eliminated by resolution? 

► 414. [M22] Generalize ( 169 ) to an encoding of lexicographic order on d - ary vectors, 

Oi • • • x n )d S ("1 • • • yn)d, where each x k = x\ + - h x^ 1 and y k = + - h y d k ~ l 

has the order encoding. What modifications to your construction will encode the strict 
relation xi ... x n < yi ... y n l 

415. [M22] Find all CNF formulas for the function (xi ® 奶 ） V • • • V (x n ® " n ). 

416. [20] Encode the condition c if xi ... x n = yi ••• Un then u\ ... u m = Vi ... v m \ us¬ 
ing 2m+2n+l clauses and n+1 auxiliary variables. Hint: 2n of the clauses are in ( 172 ). 

417. [21] Continuing exercise 42, what is the Tseytin encoding of the ternary mux op¬ 
eration c s t? u: v ’？ Use it to justify the translation of branching programs via ( 174 )- 


418. [23] Use a branching program to construct clauses that are satisfiable if and 
only if (Xij) is an m x n Boolean matrix whose rows satisfy the hidden weighted bit 
function h n and whose columns satisfy the complementary function h m . In other words, 





and = 1^ *^cjj = 


assuming that Xio = xoj = 0 . 


419. [M21] If m，n > 3， find (by hand) all solutions to the problem of exercise 418 
such that (a) ^ Xij = m + 1 (the minimum); (b) ^ Xij = van — n — 1 (the maximum). 

420. [18] Derive ( 175 ) mechanically (that is，“without thinking”）from the Boolean 
chain 5 xi ® X 2 ^ c ^ xi A X 2 ^ t s ㊉ 尤 3 , c ^ s A $ 3 , requiring c = c = 0. 
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421. [18] Derive ( 176 ) mechanically from the branching program I 5 = (1?4: 3), I 4 = 
(2?1: 2), h = (2? 2: 0), I 2 = (3? 1:0), beginning at h- 

422. [11] What does unit propagation deduce when the additional clause (xi) or (x‘ 2 ) 
is appended to (a) F in ( 175 )? (b) G in ( 176 )? 

423. [22] A representation F that satisfies a condition like ( 180 ) but with l replaced 
by e can be called “weakly forcing.” Exercise 422 shows that ( 175 ) and ( 176 ) are weakly 
forcing. Does the BDD of every function define a weakly forcing encoding, via ( 173 )? 

► 424. [20] The dual of the Pi function has the prime clauses {123,134,234,234,12} 
(see 7 - 1 - 1 -( 3 o)). Can any of them be omitted from a forcing representation? 

425. [18] A clause with exactly one positive literal is called a definite Horn clause, 
and Algorithm 7.1.1C computes the “core” of such clauses- If F consists of definite 
Horn clauses, prove that x is in the core if and only if F hi if and only if FA(x) hi e. 

► 426. [M20] Suppose F is a set of clauses that represent /(xi, … , x n ) using auxiliary 

variables {ai, … ， a m } as in ( 170 )，where m > 0. Let G be the clauses that result after 
variable a m has been eliminated as in ( 112 ). 

a) True or false: If F is forcing then G is forcing. 

b) True or false: If F is not forcing then G is not forcing. 

427. [M30] Exhibit a function /(^i，• • • ， x n ) for which every set of forcing clauses that 
uses no auxiliary variables has size Q(3 n /n 2 ), although / can actually be represented 
by a polynomial number of forcing clauses when auxiliary variables are introduced. 
Hint: See exercise 7-1-1-116. 


428. [M27] A generic graph G on vertices {1, … ， n} can be characterized by ⑵ 
Boolean variables X = {xij | 1 < i < j < n} 5 where Xij = [i —— j in G]. Properties 
of G can therefore be regarded as Boolean functions, /(X). 

a) Let f n d(X) = [x(G) < d]; that is, f n d is true if and only if G has a d-coloring. 
Construct clauses F nc i that represent the function f n( i{X) V using auxiliary 
variables Z = {zjk | 1 < j < n, 1 < k < d} that mean “vertex j has color k. v 

b) Let G n d be a forcing representation of the Boolean function F n d[X, y, Z), and 
suppose that G n d has M clauses in N variables. (These N variables should 
include the Q) + 1 + nd variables of F n d, along with an arbitrary number of 
additional auxiliaries.) Explain how to construct a monotone Boolean chain of 
cost 0(MN 2 ) for the function (see exercise 7.1-2-84), given the clauses of G n d. 

Note: Noga Alon and Ravi B. Boppana, Combinatorica 7 (1987), 1—22, proved that 
every monotone chain for this function has length exp Q((n/ log n) 1//3 ) when d + 1 = 
L(n/lgn) 2 / 3 /4」. Hence M and N cannot be of polynomial size. 

429. [22] Prove that Bailleux and Boufkhad’s clauses ( 20 ), ( 21 ) are forcing: If any r 
of the x^s have been set to 1, then unit propagation will force all the others to 0. 


430• [25] Similarly, Sinz’s clauses ( 18 ) and ( 19 ) are forcing. 

431. [20] Construct efficient, forcing clauses for the relation xi + - - - 卜 y n . 

432. [24] Exercise 404 gives clauses for the relation \x — y\ > a. Are they forcing? 

433• [25] Are the lexicographic-constraint clauses in ( 169 ) forcing? 

434. [21] Let L/ be the language defined by the regular expression in other 

words, the binary string xi ... x n is in L\ if and only if it consists of zero or more 0 s 
followed by exactly l Is followed by zero or more 0 s. 

a) Explain why the following clauses are satisfiable if and only if xi .. .x n G L\\ 
(i) (pk^Xk), (pk^Pk-i), and (pk-i^Xk^Pk) for 1 < A: < n，also (p 0 ); (ii) fe Vx/c), 
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(qk V g/c+i)，and {qk+i V x/c V qk) for 1 < A: < n, also (g n +i )； (iii) (fk V Pk-i) A 
Ao<d</( f fc v x k+d) A (f/c V qk+i) for 1 < A: < n +1 - also (n V • • • V r n+ i-/)- 
b) Show that those clauses are forcing when l = 1 but not when l = 2. 

► 435. [28] Given / > 2, construct a set of 0(n log l) clauses that characterize the 
language L/ of exercise 434 and are forcing. 

436. [M32] (Nondeterministic finite-state automata.) A regular language L on the 
alphabet A can be defined in the following well-known way: Let Q be a finite set of 
“states，” and let I C Q and O C Q be designated “input states” and “output states •” 
Also let T C Q x A x Q be a set of “transition rules.” Then the string xi ... x n isin L if 
and only if there’s a sequence of states qo, qi, … ， q n such that qo G (qk-i^Xk^qk) ^ T 
for 1 < A: < n, and q n G O. 

Given such a definition，where A = {0 ， 1}, use auxiliary variables to construct 
clauses that are satisfiable if and only if xi.. .x n G L. The clauses should be forcing, 
and there should be at most 0{n\T\) of them* 

As an example, write out the clauses for the language L 2 = 0* 1 2 CT of exercise 434. 

437. [M21] Extend exercise 436 to the general case where A has more than two letters. 

438. [21] Construct a set of forcing clauses that are satisfiable if and only if a given 

binary string xi . .. x n contains exactly t runs of Is, having lengths h) from 

left to right- (Equivalently，the string xi .. .x n should belong to the language defined 
by the regular expression 0*1“ 0 + 1 匕0 +. • • 0+l“0*.) 

► 439. [30] Find efficient forcing clauses for the constraint that xi + • • • + = t and 

that there are no two consecutive Is. (This is the special case /1 = • • • = ^ = 1 of the 
previous exercise, but a much simpler construction is possible.) 

440. [MSS] Extend exercise 436 to context free languages, which can be defined by 
a set S C N and by production rules U and W of the following well-known forms: 

U C{P^a\Pe N, ae A} Siiid W C{P ^QR \ P,Q,Re iV}, where iV is a set of 

“nonterminal symbols •” A string xi ... x n with each Xj G A belongs to the language if 
and only if it can be produced from a nonterminal symbol P ^ S. 

441. [M35] Show that any threshold function f(xi ， … ^x n ) = [w\X\ + • • • + w n x n > t] 

has a forcing representation whose size is polynomial in log \wi \ H - + log \w n . 

► 442. [M27] The unit propagation relation hi can be generalized to kth order propa¬ 
gation h/, as follows: Let F be a family of clauses and let l be a literal. If (Zi ， Z 2 , … ， Ip) 
is a sequence of literals, we write L~ = {Zi ， … ， l q } for 1 < q < p- Then 


forcing 

Nondeterministic finite-state automata 

finite-state automata 

regular language 

input states 

output states 

transition rules 

forcing 

runs of Is 

regular expression 

consecutive Is 

context free languages 

production rules 

nonterminal symbols 

threshold function 

Pseudo-Boolean constraints, see threshold functi 
unit propagation 
propagation, 众 th order 
failed literals 


F ho l <=^ e G jP; 

F \~k+i I FI e, F\L^ h e, … ,and F\L~ h k e 

for some strictly distinct literals h ， Z‘ 2 , • • ” l P with l p = l] 

F 1-^ e F \- k l and F \-^1 for some literal 1. 


a) Verify that hi corresponds to unit propagation according to this definition. 

b) Describe h 2 informally，using the concept of “failed literals.” 

c) Prove that F e or F / implies F \ l hfc e for all literals and furthermore 
that F h/c e implies F hk-\-i e, for all k > 0. 

d) True or false: F \~k I implies F \-k+i l- 

e) Let L/ C (F) = {I \ F l}. What is Lk{R’\ where R r appears in ( 7 ) and k > 0? 

f) Given k > 1, explain how to compute L^(F) and F \ L^(F) in 0{n 2k ~ l m) steps ， 
when F has m clauses in n variables. 
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443. [M24] (A hierarchy of hardness.) Continuing the previous exercise, a family of 
clauses F is said to belong to class UC^ if it has the property that 

F\L h e implies F\L h k e for all sets of strictly distinct literals L. 

(“Whenever a partial assignment yields unsatisfiable clauses, the inconsistency can be 
detected by kth order propagation.”）And F is said to belong to class PC/c if 

F\L h l implies F\L \~k I for all sets of strictly distinct literals LU 1. 

a) Prove that PCo C UCo C PCi C UCi C PC 2 C UC *2 C ..., where the set 
inclusions are strict (each class is contained in but unequal to its successor) • 

b) Describe all families F that belong to the smallest class, PCo. 

c) Give interesting examples of families in the next smallest class, UCo- 

d) True or false: If F contains n variables 5 F G PC n . 

e) True or false: If F contains n variables, F G UC n -i- 

f) Where do the clauses R f of ( 7 ) fall in the hierarchy? 

444. [M26] The following single lookahead unit resolution algorithm, called SLUR ， 
returns either ‘sat’ ， ‘unsat’，or ‘maybe’，depending on whether a given set F of clauses 
is satisfiable, unsatisfiable, or beyond its ability to decide via easy propagations: 

El. [Propagate.] If F hi e, terminate (‘unsat’). Otherwise set F •<— F| {/ | F hi /}. 

E2. [Satisfied?] If _F = 0 ， terminate (‘sat’). Otherwise set l to any literal within F. 

E3. [Lookahead and propagate-] If F | / l/i e, set F <— F ⑴ {I’ \ F | / hi l r ] and 
return to E2. Otherwise if F | [ l/i F ^ F\l\{l f \ F\l hi l f } and return to E2. 
Otherwise terminate (‘maybe’). | 

Notice that this algorithm doesn’t backtrack after committing itself in E2 to either l or l. 

a) If F consists of Horn clauses，possibly renamed (see exercise 7.1.1-55), prove that 
SLUR will never return ‘maybe’，regardless of how it chooses l in step E2. 

b) Find four clauses F on three variables such that SLUR always returns ‘sat ’， 
although F is not a set of possibly renamed Horn clauses. 

c) Prove that SLUR never returns ‘maybe’ if and only if F G UCi (see exercise 443)• 

d) Explain how to implement SLUR in linear time with respect to total clause length. 

► 445 • [22] Find short certificates of unsatisfiability for the pigeonhole clauses ( 106 )- 
( 107 )，when they are supplemented by (a) ( 181 ); (b) ( 182 ); (c) ( 183 ). 

446. [M10] What’s the maximum number of edges in a subgraph of K m ^ n that has 
girth > 6 ? (Express your answer in terms of n).) 

► 447. [22] Determine the maximum number of edges in a girth -8 subgraph of Kg,8 - 

448. [M25] What is Z(m ， n) when m is odd and n = m(m —1)/6? Hint: See 6.5—( 16 )• 

449. [21] Exhibit n x n quad-free matrices that contain the maximum number of Is 
and obey the lexicographic constraints ( 185 ) ， ( 186 )， for 8 < n < 16. 

450 • [25] Prove that there is essentially only one 10 x 10 quad-free system of points 
and lines with 34 incidences- Hint: First show that every line must contain either 3 
points or 4 points; hence every point must belong to either 3 lines or 4 lines. 

► 451. [28] Find a way to color the squares of a 10 x 10 board with three colors，so that 
no rectangle has four corners of the same color. Prove furthermore that every such 
a nonchromatic rectangle^ board has the color distribution {34, 34, 32}，not {34, 33, 33}. 
But show that if any square of the board is removed, a nonchromatic rectangle is 
possible with 33 squares of each color. 
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452. [34] Find a nonchromatic rectangle with four colors on an 18 x 18 board- 

453. [M23] An mxn matrix X = (xij) is said to be decomposable if it has row indices 
R C {1 ， •… ， m} and column indices C C {1 ， •… ， n} such that 0 < |i?| + |(7| < m + n ， 
with Xij =0 whenever (i G R and j ^ C) or {i ^ R and j G (7). It represents a 
bipartite graph on the vertices {m^ • • • ， u m } and {m，• • • ， t? n }，if [ui —— Vj] = [x^ ^ 0]. 

a) Prove that X is indecomposable if and only if its bipartite graph is connected. 

b) The direct sum X’ ㊉ X" of matrices X’ and X", where X’ is m x n and X" is 
m n x n’’，is the {m + m") x (n’ + n") “block diagonal” matrix X that has X’ 
in its upper left corner ， X" in the lower right corner，and zeros elsewhere (see 
7 -( 40 )). True or false: If the rows and columns of X’ and X" are nonnegative and 
lexicographically ordered as in ( 185 ) and ( 186 ), so are the rows and columns of X. 

c) Let X be any nonnegative matrix whose rows and columns are lexicographically 
nonincreasing, as in ( 185 ) and ( 186 ). True or false: X is decomposable if and 
only if X is a direct sum of smaller matrices X r and X" • 

454. [15] If r is an endomorphism for the solutions of /， show that f(x)= f(xr) for 
every cyclic element x (every element that’s in a cycle of r). 

455• [M20] Suppose we know that ( 187 ) is an endomorphism of some given clauses F 
on the variables {a ： i ， x‘ 2 , 尤 3 , 尤 4 }. Can we be sure that F is satisfiable if and only if FAC 
is satisfiable，when (a) (7 = 124, i.e .，C = (xi V X 2 V X 4 )? (b) C = 234? (c) C = 123? 
(d) C = 134? 


456. [M21] For how many functions /(xi ， a ： 2 ,X 3 , 尤 4 ) is ( 187 ) an endomorphism? 

457. [HM19] Show that every Boolean f(xi^X 2 ^X 3 ^X 4 ) has more than 51 quadrillio: 
endomorphisms, and an n-variable function has more than 2 2 ( n_1 ). 


458• [20] The simplification of clauses by removing an autarky can be regarded as the 
exploitation of an endomorphism. Explain why. 


► 459. [20] Let Xij denote the submatrix of X consisting of the first i rows and the 
first j columns. Show that the numbers sweep (X^*) satisfy a simple recurrence, from 
which it’s easy to compute sweep(X) = sweep(X mn ). 

460. [21] Given m, n, A:，and r，construct clauses that are satisfied by an m x n binary 
matrix X = (x^) if and only if sweep(X) < k and ^. . Xij > r. 

,J 

461 • [20] What additional clauses will rule out non-fixed points of ti and T 2 ? 

462. [M22] Explain why n ， 乃 ， and T 3 preserve satisfiability in the sweep problem. 

► 463. [M21] Show that X is a fixed point of n, T 2 , and T 3 if and only if its rows and 
columns are nondecreasing. Therefore the maximum of vX = Xij over all binary 
matrices of sweep k is a simple function of m, n, and k. 

► 464. [M25] Transformations Ti and T 2 don’t change the text’s example 10x10 matrix. 
Prove that they will never change any 10 x 10 matrix of sweep 3 that has vX = 51. 

465. [M21] Justify the text’s rule for simultaneous endomorphisms in the perfect 
matching problem: Any perfect matching must lead to one that’s fixed by every 丁 U v 

466 . [M23] Prove that when mn is even, the text’s even-odd rule ( 190 ) for endomor¬ 
phisms of m x n domino coverings has exactly one fixed point. 


467. [20] Mutilate the 7x8 and 8x7 boards by removing the upper right and lower left 
cells- What domino coverings are fixed by all the even-odd endomorphisms like ( 190 )? 
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468. [20] Experiment with the mutilated chessboard problem when the even-odd 
endomorphisms are modified so that (a) they use the same rule for all i and j; or 
(b) they each make an independent random choice between horizontal and vertical- 

► 469. [M25] Find a certificate of unsatisfiability ((7i ，（ 7 ‘ 2 ,. •., Ct) for the fact that an 
8 x 8 chessboard minus cells ( 1 , 8 ) and ( 8 , 1 ) cannot be exactly covered by dominoes hij 
and Vij that are fixed under all of the even-odd endomorphisms. Each Ck ior 1 < k < t 
should be a single positive literal. (Therefore the clauses for this problem belong to 
the relatively simple class PC ‘2 in the hierarchy of exercise 443.) 

► 470. [M22] Another class of endomorphisms，one for every 4 一 cycle，can also be used in 
perfect matching problems: Let the vertices (instead of the edges) be totally ordered 
in some fashion. Every 4-cycle can be written vo —— vi —— V 2 —— V 3 —— vo, with 

> ^3 and vo > 仍 ； the corresponding endomorphism changes any solution for 
which vovi = V 2 V 3 = 1 by setting vovi <— V 2 V 3 <— 0 and V 1 V 2 <— V 3 V 0 <— 1- Prove that 
every perfect matching leads to a fixed point of all these transformations. 

471. [16] Find all fixed points of the mappings in exercise 470 when the graph is i^ 2 n- 

472. [M25] Prove that even-odd endomorphisms such as ( 190 ) in the domino covering 
problem can be regarded as instances of the endomorphisms in exercise 470. 

► 473. [M23] Generalize exercise 470 to endomorphisms for the unsatisfiable clauses of 
Tseytin’s graph parity problems in exercise 245- 

474. [M20] A signed permutation is a symmetry of f(x) if and only if f(x) = f(xcr) 
for all and it is an antisymmetry if and only if we have f(x) = f(xa) for all x. 

a) How many signed permutations of n elements are possible? 

b) Write 7514263 in cycle form, as an unsigned permutation of {1，•. • ， 7,1， . • • ， 7}. 

c) For how many functions f of four variables is 4132 a symmetry? 

d) For how many functions / of four variables is 4132 an antisymmetry? 

e) For how many /(m，• • • ^ 7 ) is 7514263 a symmetry or antisymmetry? 

475. [M22] Continuing exercise 474, a Boolean function is called asymmetric if the 
identity is its only symmetry; it is totally asymmetric if it is asymmetric and has no 
antisymmetries. 

a) If / is totally asymmetric, how many functions are equivalent to f under the op¬ 
erations of permuting variables, complementing variables, and/or complementing 
the function? 

b) According to (a) and 7.1.1—( 95 ), the function (x W y) A (x ® z) is not totally 
asymmetric. What is its nontrivial symmetry? 

c) Prove that if / is not asymmetric, it has an automorphism of prime order p. 

d) Show that if (uvw)(uvw) is a symmetry of /， so is {uv){uv). 

e) Make a similar statement if f has a symmetry of the form (uvwxy) (uvwxy). 

f) Conclude that, if n < 5, the Boolean function f(xi, • • • ， x n ) is totally asymmetric 
if and only if no signed involution is a symmetry or antisymmetry of f. 

g) However, exhibit a counterexample to that statement when n = 6 . 

476. [M23] For n < 5, find Boolean functions of n variables that are (a) asymmetric 
but not totally asymmetric; (b) totally asymmetric. Furthermore, your functions should 
be the easiest to evaluate (in the sense of having a smallest possible Boolean chain), 
among all functions that qualify- Hint: Combine exercises 475 and 477. 

► 477. [23] [Optimum Boolean evaluation.) Construct clauses that are satisfiable if and 
only if there is an r-step normal Boolean chain that computes m given functions gi. 
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•… ， 9m on n variables. (For example，if n = 3 and gi = 〈 X 1 X 2 X 3 〉， 夕 2 = xi ㊉ x ‘2 ㊉ 尤 3 , 
such clauses with r = 4 and 5 enable a SAT solver to discover a “full adder” of minimum 
cost; see 7.1.2—( 1 ) and 7.1.2—( 22 ).) Hint: Represent each bit of the truth tables- 


► 478. [23] Suggest ways to break symmetry in the clauses of exercise 477. 


► 479. [25] Use SAT technology to find optimum circuits for the following problems: 


a) Compute 2:2, zi^ and zo^ when xi + X2 + ^3 + X 4 = (2:22:12:0)2 (see 7-l-2-(27)). 

b) Compute 2:2, zi^ and zo^ when xi -h x2 + X3 + X 4 + X5 = (z2Zizo)2. 

c) Compute all four symmetric functions >So, Si, ^2, S3 of {xi^X2^ ^3}- 

d) Compute all five symmetric functions Sq, S\^ S2, Sa of {xi, 奶，尤 3, 尤 4}. 

e) Compute the symmetric function ^3 尤 4, x^^xq). 

f) Compute the symmetric function So^(xi^ ••” xq) = [(xi + • • • + xq) mod 4 = 0]. 

g) Compute all eight minterms of {x\^X2^xz} (see 7 . 1 . 2 — ( 30 )). 


480• [25] Suppose the values 0 ， 1, 2 are encoded by the two-bit codes xix r =00, 01 ， 
and 1*, respectively, where 10 and 11 both represent 2. (See Eq. 7.1.3—( 120 )•) 

a) Find an optimum circuit for mod 3 addition: ziz r = [xix r + yiVr) mod 3. 

b) Find an optimum circuit that computes ziz r = (xi + X 2 + ^3 + ViVv) mod 3- 

c) Conclude that [xi + • • • + 三 a (modulo 3)] can be computed in < 3n steps. 


► 481. [28] An ordered bit pair xy can be encoded by another ordered bit pair \xy\ = 
{x®y)y without loss of information, because \xy\ = uv implies {uv] = xy. 

a) Find an optimum circuit that computes ([ 之 2 /】 )2 = xi + X 2 + ^ 3 - 

b) Let r[m;]= (从 ㊉ i；) + t;，and note that "[00] = 0 ， p[01j = 2 ， ^[1*] = 1. Find 
an optimum circuit that, given x\ ... X 5 , computes 2 : 12 : 22:3 such that we have 
v{x\X 2 \ + v{x^xa\ + 尤 5 = 2iy[ziz 2 ] + 2 ： 3 - 

c) Use that circuit to prove by induction that the “sideways sum” (Zfj gn j • • • 之 1 之 0)2 = 
尤 l + $2 + • • • + x n can always be computed with fewer than 4.5n gates. 


482. [26] {Erdos discrepancy patterns.) The binary sequence yi .. .yt is called strongly 
balanced if we have Ej=i — 1) | <2 for 1 < k <t. 

a) Show that this balance condition needs to be checked only for odd k > 3. 

b) Describe clauses that efficiently characterize a strongly balanced sequence- 

c) Construct clauses that are satisfied by X 1 X 2 • • • if and only if XdX 2 d --- 
is strongly balanced for 1 < d < n. 


483. [21] Symmetry between colors was broken in the coloring problems of Table 6 
by assigning fixed colors to a large clique in each graph. But many graphs have no 
large clique，so a different strategy is necessary. Explain how to encode the “restricted 
growth string” principle (see Section 7-2.1.5) with appropriate clauses, given an ordering 
V 1 V 2 • •of the vertices: The color of Vj must be at most one greater than the largest 
color assigned to { 仍， … ，巧 — i}. (In particular, vi always has color 1.) 

Experiment with this scheme by applying it to the book graphs anna, david, 
homer, huck, and jean of the Stanford Graph Base. 


484. [22] ( Graph quenching.) A graph with vertices (t ； i，• • • ， v n ) is called “quenchable” 
if either (i) n = 1; or (ii) there’s a A: such that Vk —— v/c+i and the graph on 
(vi，• • • ， Vk-u r/c+i, ••” v n) can be quenched; or (iii) there’s an l such that vi —— vi^s 
and the graph on (n , vt—i, V1+3, 切 +1 , vi+2, 切 + 4 , •… ， v n ) can be quenched. 

a) Find a 4-element graph that is quench able although V 3 —j—v\. 

b) Construct clauses that are satisfiable if and only if a given graph is quenchable. 

Hint: Use the following three kinds of variables for this model-checking problem: 
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xt,ij = [vi —— Vj at time t]^ for 1 < i < j < n — t; qt,k = [a quenching move of type 
(ii) leads to time t + 1]; st,i = [a quenching move of type (iii) leads to time t + 1]. 

► 485 • [23] Sometimes successive transitions in the previous exercise are commutative: 
For example, the effect of qt,k and 仍 +i ， /c+i is the same as qt,k -{-2 and 仍 +i，/^ Explain 
how to break symmetry in such cases, by allowing only one of the two possibilities- 

486. [21] (Late Binding solitaire.) Shuffle a deck and deal out 18 cards; then try to 
reduce these 18 piles to a single pile, using a sequence of “captures” in which one pile is 
placed on top of another pile. A pile can capture only the pile to its immediate left, or 
the pile found by skipping left over two other piles. Furthermore a capture is permitted 
only if the top card in the capturing pile has the same suit or the same rank as the top 
card in the captured pile- For example, consider the following deal: 

J9 10* 80 J* A* K 秦 AG 4* 8 秦 5 秦 5 今 2 今 10 秦 A 秦 60 3G 10 今 

Ten captures are initially possible, including 5^ x JG, A 皋 x xlO 奉 ， and 5<0> x 5 秦 . Some 
captures then make others possible, as in 8 秦 x x K 秦 x x 8<J>. 

If captures must be made “greedily” from left to right as soon as possible, this 
game is the same as the first 18 steps of a classic one-player game called “Idle Year,” 
and we wind up with five piles [see Dick’s Games of Patience (1883) ， 50-52]. But if we 
cleverly hold back until all 18 cards have been dealt，we can do much better. 

Show that one can win from this position, but not if the first move is A 奉 x J^. 

► 487. [27] There are ( 6 g 4 ) = 4426165368 ways to place eight queens on a chessboard. 
Long ago, W- H. Turton asked which of them causes the maximum number of vacant 
squares to remain unattacked- [See W. W- Rouse Ball ， Mathematical Recreations and 
Problems^ third edition (London: Macmillan, 1896), 109—IKK] 

Every subset S of the vertices of a graph has three boundary sets defined thus: 

dS = the set of all edges with exactly one endpoint G S] 
doutS = the set of all vertices ^ S with at least one neighbor G S] 

d\ n S = the set of all vertices G S with at least one neighbor ^ S. 

Find the minimum and maximum sizes of dS, 0 O utS^ and d\ n S^ over all 8 - element sets 
in the queen graph Qs (exercise 7.1-4-241). Which set answers Turton’s question? 

► 488. [24] (Peaceable armies of queens.) Prove that armies of nine white queens and 
nine black queens can coexist on a chessboard without attacking each other, but armies 
of size 10 cannot, by devising appropriate sets of clauses and applying Algorithm C. 
Also examine the effects of symmetry breaking. (This problem has sixteen symmetries, 
because we can swap colors and/or rotate and / or reflect the board.) How large can 
coexisting armies of queens be on n x n boards, for n < 11? 

489. [M21] Find a recurrence for T n , the number of signed involutions on n elements. 

► 490. [15] Does Theorem E hold also when pip 2 … is any signed permutation? 

► 491. [22] The unsatisfiable clauses R in (6) have the signed permutation 2341 as an 

automorphism. How can this fact help us to verify their unsatisfiability? 

492. [M20] Let r be a signed mapping of the variables {xi^ • •., x n }] for example, 
the signed mapping c 4133^ stands for the operation (xi ， a> 2 , 尤 3 , 尤 4 ) = 

( 无 4 , 尤 1 ，尤 3 , 无 3 ). When a signed mapping is applied to a clause, some of the resulting lit¬ 
erals might coincide; or two literals might become complementary, making a tautology- 
When t = 4133, for instance, we have (123)r = 413, (134)r = 43, (134)r = p. 
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A family F of clauses is said to be “dosed” under a signed mapping r if Cr is 
subsumed by some clause of F whenever C G F. Prove that r is an endomorphism 
of F in such a case. 

493. [20] The problem waerden(3^ 3; 9) has four symmetries, because we can reflect 
and/or complement all the variables- How can we speed up the proof of unsatisfiability 
by adding clauses to break those symmetries? 

494. [21] Show that if (uvw)(uvw) is a symmetry of some clauses F, we^re allowed to 
break symmetries as if (uv)(uv)^ (uw)(uw)^ and (vw)(vw) were also symmetries- For 
example, if i < j < k and if (ijk)(ijk) is a symmetry，we can assert {xi W Xj) A (xj V Xk) 
with respect to the global ordering pi .. .p n = 1 • • • n. What are the corresponding 
binary clauses when the symmetry is (i) (ijk)(ijk)7 (ii) (ijk)(ijk)7 (iii) (ijk)(ijk)7 

495. [M22] Spell out the details of how we can justify appending clauses to assert ( 185 ) 
and ( 186 )，using Corollary E, whenever we have an m x n problem whose variables Xij 
possess both row and column symmetry. (In other words we assume that Xij »-> $(i 7 r)(jp) 
is an automorphism for all permutations 丌 of { 1 , … ， m} and p of { 1 ， … ， n}.) 

► 496. [M20] B. C- Dull reasoned as follows: “The pigeonhole clauses have row and col¬ 
umn symmetry. Therefore we can assume that the rows are lexicographically increasing 
from top to bottom, and the columns are lexicographically increasing from right to left. 
Consequently the problem is easily seen to be unsatisfiable.” Was he correct? 

497. [22] Use BDD methods to determine the number of 8 x 8 binary matrices that 
have both rows and columns in nondecreasing lexicographic order. How many of them 
have exactly r Is, for r = 24, r = 25, r = 64 — 25 = 39, and r = 64 — 24 = 40? 

498. [22] Justify adding the symmetry-breakers ( 183 ) to the pigeonhole clauses. 

499. [21] In the pigeonhole problem，is it legitimate to include the clauses ( 183 ) 
together with clauses that enforce lexicographic row and column order? 

500. [16] The precocious student J- H. Quick decided to extend the monkey wrench 
principle, arguing that if FoU S h l then the original clauses F can be replaced by F\ 1. 
But he soon realized his mistake. What was it? 

501 • [22] Martin Gardner introduced an interesting queen placement problem in Sci¬ 
entific American 235,4 (October 1976), 134—137: “Place r queens on an m x n chess¬ 
board so that (i) no three are in the same row, column, or diagonal; (ii) no empty square 
can be occupied without breaking rule (i); and (iii) r is as small as possible •” Construct 
clauses that are satisfiable if and only if there’s a solution to conditions (i) and (ii) with 
at most r queens- (A similar problem was considered in exercise 7.1.4—242.) 

502. [16] (Closest strings.) Given binary strings 5 i, • • ” 5 m of length n, and threshold 
parameters ri ， •… ， r m , construct clauses that are satisfiable by x = xi... x n if and 
only if x differs from Sj in at most rj positions, for 1 < j < m. 

503. [M20] (Covering strings.) Given Sj and rj as in exercise 502， show that every 
string of length n is within rj bits of some Sj if and only if the closest string problem 
has no solution with parameters rj = n — 1 — rj. 

► 504. [M21] The problem in exercise 502 can be proved NP-complete as follows: 

a) Let Wj be the string of length 2n that is entirely 0 except for Is in positions 2 j — 1 
and 2j, and let w n +j = Wj^ for 1 < j < n- Describe all binary strings of length 2n 
that differ from each of wi^ •… ， W 2 n in at most n bit positions. 

b) Given a clause (/1 V I 2 V I 3 ) with strictly distinct literals h,l 2 ,h G {xi ^. • •, x n ^ 
无 1 ， ••” x n }^ let y be the string of length 2n that is entirely zero except that it has 
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1 in position 2k — 1 when some U is Xk^ and 1 in position 2k when some U is Xk. 
In how many bit positions does a string that satisfies (a) differ from yl 

c) Given a 3SAT problem F with m clauses and n variables, use (a) and (b) to 

construct strings 5 i, of length 2n such that F is satisfiable if and only 

if the closest string problem is satisfiable with rj = n + [j > 2n]. 

d) Illustrate your construction in (c) by exhibiting the closest string problems that 
correspond to the simple 3SAT problems R and R r in ( 6 ) and ( 7 ). 

505. [21] Experiment with making Algorithm L nondeterministic, by randomizing the 
initial order of VAR in step LI just as HEAP is initialized randomly in step Cl. How does 
the modified algorithm perform on, say, problems D3, KO, and W2 of Table 6 ? 

506. [22] The weighted variable interaction graph of a family of clauses has one vertex 
for each variable and the weight ^ 2 /(|c|(|c| — 1 )) between vertices u and where the 
sum is over all clauses c that contain both 士乜 and 士认 Figure 52 indicates these weights 
indirectly, by making the heavier edges darker. 

a) True or false: The sum of all edge weights is the total number of clauses. 

b) Explain why the graph for test case B2 has exactly 6 edges of weight 2. What 
are the weights of the other edges in that graph? 

► 507. [21] (Marijn Heule.) Explain why “windfalls” (see ( 72 )) help Algorithm L to 
deal with miter problems such as D5. 

508. [M20] According to Table 7, Algorithm C proved problem T3 to be unsatisfiable 
after learning about 323 thousand clauses- About how many times did it enter a purging 
phase in step C7? 

509 • [20] Several of the “training set” tasks used when tuning Algorithm C’s param¬ 
eters were taken from the 100 test cases of Table 6 . Why didn’t this lead to a problem 
of “overfitting” （ namely，of choosing parameters that are too closely associated with 
the trainees)? 

510. [18] When the data points Al, A2, … ， X 8 were plotted in Fig. 55, one by one, 
they sometimes covered parts of previously plotted points, because of overlaps. What 
test cases are partially hidden by (a) T2? (b) X 6 ? (c) X7? 

511. [22] Problem P4 in Table 6 is a strange set of clauses that lead to extreme behav¬ 
ior of Algorithm C in Figs. 54 and 55; and it causes Algorithm L to “time out” in Fig. 53. 

a) The preprocessing algorithm of the text needs about 1.5 megamems to convert 
those 2509 clauses in 400 variables into just 2414 clauses in 339 variables- Show 
empirically that Algorithm L makes short work of the resulting 2414 clauses. 

b) How efficient is Algorithm C on those preprocessed clauses? 

c) What is the behavior of Walks AT on P4, with and without preprocessing? 

512. [29] Find parameters for Algorithm C that will find an Erdos discrepancy pattern 
X 1 X 2 • • • rapidly when n = 500. (This is problem E0 in Table 6 .) Then compare 
the running times of nine random runs with your parameters versus nine random runs 

with ( 194 )，when n = 400, 500, 600 ， … ， 1100 ， 1160, and 1161. 

513. [24] Find parameters for Algorithm L that tune it for rand (3, seed). 

514. [24] The timings quoted in the text for Algorithm W, for problems in Table 6 , 
are based on the median of nine runs using the parameters p = A and TV = 50n ， 
restarting from scratch if necessary until a solution is found- Those parameters worked 
fine in most cases, unless Algorithm W was unsuited to the task. But problem C9 was 
solved more quickly with p = .6 and N = 2500n (943 Mfi versus 9.1 Gfi). 

Find values of p and N/n that give near-optimum performance for problem C9- 
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► 515. [23] (Hard sudoku.) Specify SAT clauses with which a designer of sudoku puzzles 
can meet the following specifications: (i) If cell (i ， j) of the puzzle is blank，so is 
cell (10—i ， 10—j), for 1 < j < 9. (ii) Every row, every column, and every box contains 
at least one blank. (Here “box” means one of sudoku’s nine special 3x3 sub arrays*) 
(iii) No box contains an all-blank row or an all-blank column, (iv) There are at least 
two ways to fill every blank cell, without conflicting with nonblank entries in the same 
row, column, or box. (v) If a row, column, or box doesn’t already contain there are 
at least two places to put k into that row, column, or box, without conflict- (vi) If the 
solution has a 2 x 2 subarray of the form % , those four cells must not all be blank. 

(Condition (i) is a feature of “classic” sudoku puzzles. Conditions (iv) and (v) 
ensure that the corresponding exact cover problem has no forced moves; see Section 
7-2.2.1. Condition (vi) rules out common cases with non-unique solutions.) 

516. [MJf.9] Prove or disprove the strong exponential time hypothesis: “If 丁 < 2, there 
is an integer k such that no randomized algorithm can solve every /cSAT problem in 
fewer than r n steps, where n is the number of variables.” 


517. [25] Given clauses (7i，• • • ， C m , the one-per-clause satisfiability problem asks if 

there is a Boolean assignment x\ .. .x n such that every clause is satisfied by a unique 
literal- In other words, we want to solve the simultaneous equations = 1 for 

1 < j < m, where EC is the sum of the literals of clause C. 

a) Prove that this problem is NP-complete, by reducing 3SAT to it. 

b) Prove that this problem, in turn, can be reduced to its special case “one-in-three 
satisfiability,” where every given clause is required to be ternary. 

518. [M32] Given a 3SAT problem with m clauses and 


n variables，we shall construct a ( 6 m + n) x ( 6 m + n) 
matrix M of integers such that the permanent, per M, 123 
is zero if and only if the clauses are unsatisfiable. For 
example, the solvable problem ( 7 ) corresponds to the 
46 x 46 matrix indicated here; each shaded box stands 
for a fixed 6 x 6 matrix A that corresponds to a clause. 

Each A has three “inputs” in columns 1, 3, 5 and 
three “outputs” in rows 2, 4, 6 . The first n rows and the 
last n columns correspond to variables- Outside of the 
As, all entries are either 0 or 2 ; and the 2s link variables 
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to clauses, according to a scheme much like the data 
structures in several of the algorithms in this section: 

Let Iij and Oij denote the jth input and output of clause for 1 < i < m and 1 < j < 3. 
Then, if literal l appears in t > 0 clauses ii < ••• < it, as element ji, … ， jt, we put ‘2’ 
in column Ii k+1 j k+1 of row Oi k j k for Q S k S t (O《 0 j is row |Z| ， Ii t+1 j is column 6 m + |/|). 
a) Find a 6 x 6 matrix A = (ay)，whose elements are either 0, 1, or —1, such that 
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Hint: There’s a solution with lots of symmetry. 

b) In which of the rows and columns of M does occur twice? once? not at all? 

c) Conclude that per M = 2 4 m + n s, when the 3SAT problem has exactly s solutions. 
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519. [20] Table 7 shows inconclusive results in a race for factoring between factor—fifo 
and factorJifo. What is the comparable performance of factor-rand (m ， n ， z, 314159)? 


520. [24] Every instance of SAT corresponds in a natural way to an integer program¬ 
ming feasility problem: To find, if possible, integers xi^ ^ x n that satisfy the linear 
inequalities 0 < Xj < 1 for 1 < j < n and 


/1 + + • • • + 4 > 1 for each clause (7 = (Zi V Z ‘2 V • • • V Ik). 


For example, the inequality that corresponds to the clause (xi V X 3 V X 4 V X 7 ) is 

Xl + (1—X3) + (I—X 4 ) + X7 > 1 ； i-e., Xl — X 3 — X4 ^ X7 > —1. 

Sophisticated “IP solvers” have been developed by numerous researchers for solv¬ 
ing general systems of integer linear inequalities, based on techniques of “cutting 
planes” in high - dimensional geometry* Thus we can solve any satisfiability problem 
by using such general-purpose software, as an alternative to trying a SAT solver. 

Study the performance of the best available IP solvers, with respect to the 100 
sets of clauses in Table 6 , and compare it to the performance of Algorithm C in Table 7. 

521 • [30] Experiment with the following idea, which is much simpler than the clause¬ 
purging method described in the text: “Forget a learned clause of length k with 
probability pk，’’ where pi > p *2 > P 3 > is a tunable sequence of probabilities. 

► 522. [26] (Loopless shadows.) A cyclic path within the 
cube P 3 □ ^3 □ P 3 is shown here, together with the three 
“shadows” that appear when it is projected onto each co¬ 
ordinate plane- Notice that the shadow at the bottom 
contains a loop, but the other two shadows do not. Does 
this cube contain a cycle whose three shadows are entirely 
without loops? Use SAT technology to find out- 

523• [30] Prove that，for any m or n, no cycle of 
the graph P m D P n □ P 2 has loopless shadows. 

► 524. [22] Find all Hamiltonian paths of the cube 
P 3 □ ^3 □ P 3 that have loopless shadows. 

► 525. [40] Find the most difficult 3SAT problem you can that has at most 100 variables. 

526. [M25] (David S. Johnson ， 1974.) If F has m clauses, all of size > prove that 
some assignment leaves at most m/2 k clauses unsatisfied. 

999• [MOO] this is a temporary exercise (for dummies) 
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SECTION 7.2.2.2 

1 . (a) 0 (no clauses). (b) {e} (one clause, which is empty)- 

2. Letting 1 lazy, 2 happy, 3 ― > unhealthy, 4 dancer, we’re given the 
respective clauses {314,142, 342, 243,132, 231,143}, matching R f in ( 7 ). So all known 
Pincusians dance happily, and none are lazy. But we know nothing about their health. 
[And we might wonder why travelers have bothered to describe so many empty sets.] 

3. f(j - l ， n) + f(k - l ， n)，where f(p,n) = ^ =1 (n - pd) = p{? 2 ) + q(nmodp) « 
n 2 /( 2 p), if we set q = \n/p \. 

4. Those constraints are unsatisfiable if and only if we remove a subset of either 
{357,456,357,456}, {246,468,246,468}, {246,357,468,456}, or {456,246,357,468}. 

5. No polynomial upper bound for W (3, k) is currently known. Clearly W (3, k) is 
less than VF(3, k), the minimum n that guarantees either three equally spaced Os or A: 
consecutive Is. An analysis by R- L. Graham in Integers 6 (2006) ， A29:l—A29:5, beefed 
up by a subsequent theorem of T. H. Bloom in arXiv: 1405.5800 [math.NT] (2014), 
22 pages, shows that VF(3, k) = exp 0 (A:(log A:) 4 ). 

6 . Let each Xi be 0 with probability p = (2\nk)/k^ and let n be at most k 2 /(In A:) 3 . 
There are two kinds of “bad events ”： a set of three equally spaced Os，occurs 
with probability P = p 3 ; and a set of k equally spaced ls，occurs with probability 
P r = (1 — p) k < exp(—kp) = 1/A: 2 - In the lopsidependency graph, which is bipartite, 
each Ai is adjacent to at most D = 3k 3 /((k — l)(lnA:) 3 ) nodes A f j] each A f j is adjacent 
to at most d = |A: 3 /(ln k) 3 nodes Ai. By Theorem we want to show that，for all 
sufficiently large values of k, P < y(l _ x) D and P f < x{l — y) d ^ for some x and y. 

Choose x and y so that (1 — x) D = 1/2 and y = 2P. Then x = ©((log k) 3 /k 2 ) and 
y = ©((log k) 3 /k 3 ); hence (1 — y) d = e^p(-yd-\-0(y 2 d)) = 0(1). [See T. Brown, B. M. 
Landman，and A. Robertson, J. Combinatorial Theory A115 (2008) ， 1304—1309.] 

7. Yes，for all n, when X 1 X 2 X 3 … = 001001001 •… 

8 . For example, let Xi^ a signify that Xi = for 1 < i < n and 0 < a < 6 . The relevant 
clauses are then 而 ， 0 V • • • V Xi^ b -i for 1 < i < n; and Xi^ a V x i+d ,a V • • • V x i+ ^ ka ^ 1 ) d ^ a , 
for 1 < i < n — (k a — l)d and d > 1. Optionally include the clauses Xi^ a V 屯 〆 for 
0 < a < a f < b. (Whenever the relevant clauses are satisfiable，we can also satisfy the 
optional ones by falsifying some variables if necessary.) 

[V. Chvatal found W (3,3,3) = 27. Kouril’s paper shows that W (2,4, 8 ) = 157, 
W(2,3,14) = 202, W(2,5,6) = 246, W(4,4,4) = 293, and lists many smaller values.] 

9. W (2, 2, k) = 3k — (2, 0, 2, 2,1 ， 0) when k mod 6 = (0, 1 ， 2, 3,4, 5). The sequence 
2 /c "" 1 02 /c "' 1 12 /c "" 1 is maximal when A: 丄 6 ; also 2 /e "" 1 02 /c_ 1 12 /c ~ 3 when A: mod 6 = 3; 
also 2 k ~ 1 02 k ~ 2 12 k ~ 1 when k mod 6 = 4; otherwise 2 k ~ 1 Q2 k ~ 2 12 k ~ 2 . [See B. Landman, 
A. Robertson，and C- Culver, Integers 5 (2005), A10:l—A10:ll, where many other 
values of VF(2, • • • ， 2, A:) are also established.] 

10. If the original variables are {1， ••” n}，let the new ones be {1 ， … ， n}U{l’ ， … ， n’}. 
The new problem has positive clauses {11 ，， … ， nn，}. Its negative clauses are, for 
example, 2 f 679 f if 2679 was an original clause- The original problem is equivalent 
because it can be obtained from the new one by resolving away the primed variables- 

[One can in fact construct an equivalent monotonic problem of size 0{m + n) in 
which (xi V • • • V Xk) is a positive clause if and only if (xi V • • • V Xk) is a negative 
clause. Such a problem, “not-all - equal SAT,” is equivalent to 2-colorability of hyper¬ 
graphs. See L. Lovasz, Congressus Numerantium 8 (1973) ， 3-12; H. Kleine Biining and 
T. Lettmann, Propositional Logic (Cambridge Univ. Press, 1999), §3.2, Problems 4-8.] 
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11* For each variable i，the only way to match vertices of the forms ij and ij" is to 
choose all of its true triples or all of its false triples. 

For each clause j, the vertex pairs {j’2,/3} ， {j’4 ， j’5}，define three 
“slots ”； hence two of the vertices {wj ， xj ， yj ， zj} must be matched into the same slot. 
Furthermore we can’t have two in one slot and two in another, because the remaining 
slot would then be unmatched. Thus two of the Ij vertices are matched in their slot ， 
while the other is matched with /I， whenever we have a perfect matching. 

Conversely, if all clauses are satisfied, with Ik true in clause j, there always are 
exactly two ways to match l^j with jl while matching wj, xj, yj, zj, and the other 
two Ij vertices with j’2, . • • ， j’7. (It’s a beautiful construction! Notice that no vertex 
appears in more than three triples.) 

12. Equation ( 13 ) says Si(yi,... ,y p ) = S>i(yi,... ,y p ) A S<i(yi,... ,y p ). If p < 4, 

use Ai<j<k<p(yj v Vk) for S<i(yi,... ,y p ); otherwise S<i(yi,... ,y p ) can be encoded 
recursively via the clauses 1 / 2 , ？ / 3 , 尤)八 (f ， 2 / 4 , • • •, ?/ P ), where tis a new variable. 

[This method saves half of the auxiliary variables in the answer to exercise 7.1.1—55(b).] 

Note: Langford’s problem involves primary columns only; in an exact cover prob¬ 
lem with nonprimary columns, such columns only need the constraint • • •, y p ). 

13. (a) Si(xi^X2^X3^X4^X 5 ^X6) A Si(x7^X 8 ^X9^Xio^Xn) A Si(xi2^Xi3) A 5i(Xi4 ， Tl5, 
X 1 6 ) ASi (X1^X7^XV2^X14) ASi (X2^X 8 ^X13^X1 5 ) ASi (xi^X 3 ^Xq^Xi 6 ) ASi ( 尤 2 ，為， X7^Xio) A 
Si (X3^X5^X 8 ^X11^X12) A 51 (x 4 ^ 6 ^ 9 ^ 13 ^ 14 ) A 51 (x 5 ^ 10 ^ 15 ) A Si (X 6 ^X 11 ^X 1 6 )^ 

(b) Duplicate clauses occur when rows intersect more than once. We avoid them 
if we simply generate clauses XiM Xj for every pair (i ， j) of intersecting rows. 

(c) When langford (4) is generated in this way，it has 85 distinct clauses in 16 vari¬ 
ables, namely (a：i Va ；2 VX 3 VX 4 VX 5 Va ： 6 ) A (a ： 7 VxsVxgVxioVxn) A- • • A (xeVxn Va ： i 6 ) A 

(X 1 WX 2 ) A (X 1 WX 3 ) 八 … 八 {xi^\Jxiq). 

But langford’ （ 4：) cannot use the trick of (b). It has 85 (nondistinct) clauses in 20 
variables, beginning with 123456, 12, 13, lT, 23, 21 f , 31 7 , l x 4, 1 ; 5, l^, 45, 46, 56, …， 
if we denote the auxiliary variables by 1’ ， 2’， ••… Two of those clauses (13 and 46) are 
repeated. (Incidentally, langford’ （ 12 、has 1548 clauses, 417 variables, 3600 cells.) 

14. (Answer by ML Heule.) Those clauses sometimes help to focus the search. For 
example, if we’re trying to color the complete graph K n with n colors (or pigeons), we 
don’t want to waste time trying V 2 = 1 when vi is already 1- 

On the other hand, other instances of SAT often run slower when redundant clauses 
are present，because more updates to the data structures are needed. 

We might also take an opposite approach, and replace ( 17 ) by nd clauses that 
force every color class to be a kernel (See exercise 21.) Such clauses sometimes speed 
up a proof of uncolorability- 

15. There are N = n(n+l) vertices (j, k) for 0 < j < n and 0 < A; < n. If (j, k) = (1,0) 

we define k) —— (n 5 i) for x < i < where x = L n /2」. Otherwise we define the 
following edges: (j,k ) —— （j + 1 ， A: + 1) if j < n and k < n — 1] (j, k) —— (j + 1, k) if 
j < n and j ^ k; (j, k) —— (j, k + 1) if k < n — 1 and j ^ k 1; (j, k) —— (n, n — 1) if 

j = 0; (j, k ) —— (n — j, 0) if A: < n — 1 and j = k; (j, k ) —— (n + 1 — j, 0) if j > 0 and 

j = k; (j, k ) —— (n - J, n - j - 1) if A: = n - 1 and 0 < j < A:; (j, k ) —— (n + 1 — j, n - j) 

if A: = n — 1 and Q < j < n. Finally ， (0, 0) —— (1 ， 0)，and (0, 0) —— (n ， i) for 1 < i < x. 

That makes a grand total of 3N — 6 edges (as it should in a maximal planar graph, 
according to exercise 7-46). 

16. There’s a unique 4-clique when n > 5, namely {(0，n — 2) ，（ 0，n — 1) ，（ l，n — 1 )， 
(n，n — 1)}. All other vertices，except (0, 0) and (1,0)，are surrounded by neighbors 
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that form an induced cycle of length 4 or more (usually 6 ) - [See J.-L Laurier ，Artificial 

Intelligence 14 (1978), 117.] 

17. Let mcgregor(n) be the clauses ( 15 ) and ( 16 ) for the graph. Add clauses ( 19 )，for 
symmetric threshold functions to bound the number of variables v\ for color 1 ; the kth 
vertex Xk can be specified by the ordering in answer 20. Then if, for instance, we can sat¬ 
isfy those clauses together with the unit clause , where N = n(n + l )， we have proved 
that f (n) < r • Similarly，if we can satisfy them together with 5 ^ , we have proved that 
g{n) > r. Additional unit clauses that specify the colors of the four clique vertices will 
speed up the computation: Four cases should be run, one with each clique vertex receiv¬ 
ing color 1, If all four cases are unsatisfiable，weVe proved that /(n) > r or g(n) < r, 
respectively- Binary search with different values of r will identify the optimum. 

For speedier g(n), first find a maximum independent set instead of a complete 
4-coloring; then notice that the colorings for f(n) already achieve this maximum. 

The results turn out to be /(n) = (2 ， 2 ， 3 ， 4 ， 5 ， 7 ， 7 ， 7 ， 8 ， 9 ， 10 ， 12 ， 12 ， 12) for 
n = (3,4,, 16), and g{n) = (4, 6 , 10, 13, 17, 23, 28, 35, 42, 50, 58, 68 , 77, 88 ). 

18. Assuming that n > 4, first assign to vertex (j,k) the following “default color ”： 
1 + (j + k) mod 3 if j < k; 1 + (j + A: + 1 — n) mod 3 if A: < j/2; otherwise 1 + (j + 
k + 2 — n) mod 3. Then make the following changes to exceptional vertices: Vertex 
(1 ， 0) is colored 2 if n mod 6 = 0 or 5, otherwise 3. Vertex (n, n — 1) is colored 4. For 
A: l 0 up to n — 2， change the color of vertex (n ， A:) to 4, if its default color matches 
vertex (0,0) when k < n/2 or vertex (1 ， 0) when k > n/2. And make final touchups 
for 1 < j < n/2, depending again on n mod 6: 

Case 0: Give color 4 to vertex (2j，j — 1) and color 1 to vertex (2j + l ， j). 

Case 1: Give color 4 to vertex (2j ， j) and color 2 to vertex (2j + l ， j). 

Case 2: Give color 4 to vertex (2j ， j) and color 1 to vertex (2j + 1 ， j). Also give 

(n，n — 2) the color 1 and (n — 1, n — 3) the color 4. 

Cases 3, 4 ， 5: Give color 4 to vertex (2j + l,j). 

For example, the coloring for the case n = 10 (found by Bryant) is shown in Fig. A—5(a). 
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Fig. A— 5. Colorings and kernels of McGregor’s graph. 


The color distribution is ([n 2 /3j, [n 2 /3\^ [n 2 /3\^5k) + ((0, 1 ， A: ， —1) ，（ 1 ， A: ， 1 ， 0 )， 
(—1, A:+ 1,1, 2), (0,A:,1,2), (1, A:+ 1,1, 2), (0,2, A:+ 1,3)), for n mod 6 = (0,1,2, 3,4,5), 

k = L n / 6 」• Since this construction achieves all of the optimum values for /(n) and g(n), 
when n < 16, it probably is optimum for all n. Moreover, the value of g{n) agrees with 
the size of the maximum independent set in all known cases. A further conjecture is 
that the maximum independent set is unique, whenever n mod 6 = 0 and n > 6 . 
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19. Use the clauses of mcgregor 、 n), together with (vi V 仍 V 仍 V 〜) 八 (m V 仍 V 说 V 〜 ) 八 
(vi W V 3 W va\/ v x ) /\ {v 2 \/ V 3 W V 4 W v x ) for each vertex, together with clauses from ( 20 ) 
and ( 21 ) that require at least r of the vertices v x to be true. Also assign unique colors 
to the four clique vertices- (One assignment, not four, is sufficient to break symmetry 
here, because h{n) is a more symmetrical property than /(n) or g{n).) These clauses 
are satisfiable if and only if h{n) > r. The SAT computation goes faster if we also 
provide clauses that require each color class to be a kernel (see exercise 21 ). 

The values /i(n) = (1 ， 3, 4, 8 , 9 ， 13) for n = (3,4, ••• ， 8 ) 
are readily obtained in this way. Furthermore, if we extend 
color class 4 in the construction of answer 18 to a suitable ker¬ 
nel, we find h(9) > 17 and /i(10) > 23. The resulting diagram 
for n = 10, illustrated in Fig. A-5(b), nicely exhibits 2 23 so¬ 
lutions to McGregor’s original coloring problem, all at once. 

A good SAT solver also shows that /i(9) < 18 and /i(10) < 

23, thus proving that /i(10) = 23. And Armin Biere’s solver 
proved in 2013 that /i(9) = 18, by discovering the surprising 
solution shown here. (This exercise was inspired by Frank Bernhart, who sent a diagram 
like Fig. A—5(b) to Martin Gardner in 1975; his diagram achieved 2 21 solutions.) 

20. Arrange the vertices (j, k) of answer 15 in the following order vo^vi^ ... : (n，n — 1); 

(0，n - 1), (0,n - 2)，, (0,0); (l,n — 1), (l,n - 2), , (1,1); ; (n - 2,n - 1), 

(n-2,n-2); (n-l,n-2), (n-2,n-3), (2,1); (n - l,n- 1); (2,0), (3,1),, 

(n,n-2); (3,0), (4,1), (n,n-3); (1,0); (4, 0), ..., (n,n-4); (n-1,0), (n, 1); 

(n, 0). Then if Vt = {vo, ••” vt-i}^ let the “frontier” G consist of all vertices G Vt that 
have at least one neighbor ^ Vt. We can assume that ( 你，仍， 仍 ） are colored (0 ， 1,2), 
because they are part of the 4-clique. 

All 4-colorings of Vt that have a given sequence of colors on Ft can be enumerated 
if we know the corresponding counts for Ft-i. The stated ordering ensures that Ft 
never will contain more than 2 n — 1 elements; in fact, at most 3 2n - 2 sequences of colors 
are feasible, for any given t. Since 3 18 is less than 400 million, it’s quite feasible to do 
these incremental calculations. The total (obtained with about 6 gigabytes of memory 
and after about 500 gigamems of computation) turns out to be 898,431 ， 907,970,211. 

This problem is too large to be handled efficiently by BDD methods when n = 10, 
but BDD calculations for n < 8 can be used to check the algorithm. The frontiers essen¬ 
tially represent level-by-level slices of a QDD for this problem. The 4-coloring counts 
for 3 < n < 9 are respectively 6 , 99, 1814, 107907, 9351764, 2035931737, 847019915170. 



21. With one Boolean variable v for every vertex of a graph the kernels are 
characterized by the clauses (i) u\/ v whenever u —— v; (ii) v V \J U _ v u for all v. 
Adding to these the clauses for the symmetric threshold function S< r (xi^ ••” xn), we 
can find the least r for which all clauses are satisfiable. The graph of Fig. 33 yields 
satisfiability for r = 17; and one of its 46 kernels of size 17 is shown in Fig. A—5(c). 

[BDD methods are slower for this problem; but they enumerate all 520,428,275,749 
of the kernels, as well as the generating function 46 之 17 +471802： 18 + • • • + 317 之 34 -\-2z So .] 

12771 

22. Eight colors are needed. The coloring ■盟雲 is “balanced,” with each color used 


at least thrice. 


34655 

14451 


23. Writing k for Xk and J for Sj^ the clauses from (i 8 )-(ig) are } \ \ 3 ^ ?! ， 11 ， ？ 2 , 

33 44 4 4 T 1 9 1 o 1 o 1 2 o I 2 7 12 o 2 3 7 2 3 ^23 734 r 3 4 ^34 r 4 ^4 十 4 

2 3 5 1 2 5 2 3^ 1 l5 5 °35 15 °2 2^ ^33^ 1^ ^2 2 5 ° 3 3 ^ ^1 1^ ° 2 2 ^ u 33^ ° 1 ^ u 2^ { 3 - 


Similarly ， ( 20 ) and ( 21 ) define the clauses 7®, 6® 5 67 2 ； 招塁； 23 ^； 

T 3 63 6 T 3 63 6 T 3 4 5 2 4 2 5 2 4 2 4 5 2 4 5 2 4 5 2 4 5 2. 2 3 2 3 2 3 

1 15 1 15 1 1 2 5 2 2 5 2 1 3 ， 1 1 5 1 15 2 2 ， 2 2 5 1 1 2 5 12 3^ 2135 2 2 4 ， 41 ， 3 2 ， 2 3 • 
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this tree-based method apparently needs one more variable and two more clauses when 
(n 5 r) = (7,4). But the next exercise shows that ( 18 ) and ( 19 ) don’t really win! 

24. (a) The clause ( 6 f V bl) appears only if = r; and < n/2. 

(b) For example ， 尤 3 = min(r ， 4) < r when n = 11 and r = 5. 

(c) In this case h is the number of leaves below node and the only auxiliary 
variables that survive pure literal elimination are bt k . We^re left with just n—1 surviving 
clauses, namely (b^ k V b^ L + + \ V for 1 < A: < n, plus (b 2 t2 V 6 ? 3 ). 

(d) If 2 k < n < 2 k - 2 k ~ l we have {n\n") = (n — 2 fc_ 1 , 2 k ~ 1 ); on the other hand 
if 2 k + 2 fc_1 < n < 2 fc+1 we have (n’ ， n") = (2 fc , n — 2 k ). (Notice that n" <n< 2n".) 

(e) No pure literals are removed in this completely balanced case (which is the 
easiest to analyze). We finda(2 fc ,2 /c ~ 1 ) = (k-l)2 k and c(2 fc ,2 /c_1 ) = (2 &— 2 + A: —1)2' 

(f) One can show that a(n ， r) = (r < n"? 6 (n’ ， r) + b(n\r): r < nl b(n\n f, ) + 

b(n\n n ): b(n’ ， n—r)+b(n",n—r)), where 6(1 ， 1)=0 and6(n ， r) = r+ 6 (n , ， min(r ， n , )) + 
6 (n’’,min(r,n’’)) for n > 2. Similarly, c(n, r) = (r < n ’’？ r + /(n’,0,r) + /(n’’,0,r): 
r < n? n" + f(n 、 r- rz", r*) + f(n",0, n")\ n-r-\- f (n’ ， r — n\ n) -f- f(n"，r — n’ 
where f(n,l,r) = ^^ =/+ 1 min(A:-f-l, n’’+l, n-\-l — k)-\-(r < n"l r+/(n / , 0 , r)+/(n // , 0 , r): 
r < n? n" -\- /(n 7 , 0 , r) H- r < n? n — r + f(n,0, n) + /(n^, 0 , n")\ 

r — l < n" 1 f(n\n — r +1, n)-\- f{n", n" — r + /, n")\ r — l < nl f(n\ n — r + /, n) -\- 
/(n", 0, n")\ f(n’,0,n’）+ /(n", 0, n’’)) for n > 2 and /(I ， 0, 1) = 0. The desired results 
follow by induction from these recurrence relations. 

Incidentally, ternary branching can give further savings. We can, for example, 
handle the case n = 6 , r = 3 with 17 clauses in the 6 variables 6 璧， 6 赢， 6 璧， 6 置 . 

25. From ( 18 ) and ( 19 ) we obtain 5n — 12 clauses in 2n — 4 variables, with a simple 
lattice-like structure* But ( 20 ) and ( 21 ) produce a more complex tree-like pattern, with 
2n — 4 variables and with L n /2」nodes covering just two leaves. So we get L n /2」nodes 
with 3 clauses, n mod 2 nodes with 5 clauses, |"n/2] nodes with 7 clauses, and 2 clauses 
from ( 21 ), totalling 5n — 12 as before (assuming that n > 3). In fact, all but n — 2 of 
the clauses are binary in both cases- 


26. Imagine the boundary conditions 5 ° = 1, s^ +l = 0 

1 < k < r. The clauses say that and that 

follows by induction on j and k. 


so 


0, for 1 < j < n — r and 


k 


< so the hint 


Setting j = n — r and A: = r +1 shows that we cannot satisfy the new clauses when 

xi H -+ x n > r + 1- Conversely，if we can satisfy F with xi + • • • + < r then we 

can satisfy ( 18 ) and ( 19 ) by setting 5 ^ ^ [xi + • • • + Xj+k-i > A:]- 


27. Argue as in the previous answer, but imagine that 6 q = 1, = 0; prove the 

hint by induction on j and n — k (beginning with k = n — 1, then k = n — 2, and so on). 


28. For example, the clauses for xi + • • • + < n — 1 when n = 5 are (xi V s })， 

(X 2 V 5 } V sl)^ (X 3 V 5 ? V 5i), {xa V 5? V 5i), (a：5 V 5i)- We may assume that n > 4; 
then the first two clauses can be replaced by (xi V X 2 V and the last two by 
(x n -i W x n W yielding n — 2 clauses of length 3 in n — 3 auxiliary variables. 


29. We can assume that 1 < ri < • • • < r n = r < n. Sinz’s clauses ( 18 ) and ( 19 ) actu¬ 
ally do the job nicely if we also assert that Sj is false whenever A: = n +1 and j = i — n. 

30. The clauses now are (sj\/ 5j +1 ), {xj^k V 5j V 5》 +1 ), ( 5 》 V 吞 》 +1 ), (^j+k V 5 ^ V 5j +1 ), 
hence they define the quantities 5 》 =[xi + • • • + Xj^k-i > 众 ]; implicitly Sq = s r - +l = 0 

and s°j = s^_ r+1 = 1. The new clauses in answer 23 are } \, \ \ \, 22 ， 22 ， 2 2 ^ 3 3 5 

2 3 3 4 . T I 92 q 3 A 4 ol I q 2 2 A 3 3 ^4 4 o 1 I a 2 2 3 ^4 4 aI r 2 ^3 7 4 

3 3 5 33 ， 1 1 5 z 1 5 ° 1 5 ^ 1 5 z 1 2 5 ° 1 2 5 ^12 5 ° 1 2 ^ ° 2 3 5 ^ 2 3 5 ° 2 3 ^ u 2 3 5 ^ 3 5 ° 3 5 u 3 5 { 3- 
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recurrence relations 
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23456789 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 

2 4 5 9 11 13 14 20 24 26 30 32 36 40 41 51 54 58 63 71 74 82 84 92 95100 

2 3 5 6 8 9 10 13 15 17 19 21 23 25 27 28 30 33 34 37 40 43 45 48 50 53 

2 3 4 6 7 8 9 11 12 13 14 16 17 18 19 24 25 27 28 29 31 33 34 36 37 38 

2 3 4 5 7 8 9 10 12 13 14 15 17 18 19 20 22 23 24 25 26 29 32 33 35 36 


Furthermore, significant speedup is possible if we also make use of previously 
computed values Ft(1), ••” Ft(r-l). For example, when t = 3 and r > 5 we must have 

x a +iH - hx a +8 < 4 for 0 < a < n — 8, because ^(5) = 9. These additional subinterval 

constraints blend beautifully with those of exercise 30, because x a +i + • • • + x a +p < q 
for 0 < a < n — p implies s^ +p _ q V s^ q for0<6<n+l—p + ^ — r and q < k <r. 


We can also take advantage of left-right symmetry by appending the unit clause 
彡 d /21 when r is odd; 一 " 2+1 when n and r are both even. 

Suitable benchmark examples arise when computing, say ， ^(27) or ^(36) - But for 
large cases, general SAT-based methods do not seem to compete with the best special- 
purpose backtrack routines- For example, Gavin Theobald and Rodolfo Niborski have 
obtained the value ^(41) = 194, which seems well beyond the reach of these ideas. 

[See P- Erdos and P. Turan, J. London Math. Soc. (2) 11 (1936) ， 261-264; errata ， 
34 (1959), 480; S. S. Wagstaff, Jr., Math. Comp. 26 (1972), 767-771.] 

32. Use ( 15 ) and ( 16 ), and optionally ( 17 )，but omit variable Vj unless j G L{v). 

33. To double-color a graph with k colors，change ( 15 ) to the set of k clauses 仍 V • • • V 
Vj-i V 巧 +1 V v/c, for 1 < j < k; similarly，(^) clauses of length k — 2 will yield a triple 
coloring. Small examples reveal that C 2/+1 for / > 2 can be double-colored with five 
colors: {1 ， 2}({3, 4}{5,1}) Z 一 1 {2, 3}{4, 5}; furthermore, seven colors suffice for triple 

coloring when l > 3: {1,2, 3}({4, 5,6}{7,1,2}) /_ 2 {3,4, 5}{6, 7,1}{2, 3,4}{5, 6 ,7}. The 
following exercise proves that those colorings are in fact optimum. 

34. (a) We can obviously find a g-tuple coloring with qx{G) colors- And McGregor’s 
graph has a four - clique, hence %*((^) > 4. 

(b) Any g-tuple coloring with p colors yields a solution to the fractional exact cover 
problem, if we let Aj = ^2^ =1 [Sj is the set of vertices colored i]/q- Conversely，the 
theory of linear equalities tells us that there is always an optimum solution with rational 
{Ai ， …， 入 at}; such a solution yields a g-tuple coloring when each is an integer. 


6 3 
2 3 5 


4 2 
2 4 


5 2 
2 4 


4 2 4 5 2 5 2 4 5 2 4 5 2 
1 35 2 2 3 5 1 35 12 2 5 2 12 


4 5 


2 ， 45 f; 3 

2 3 2 3 2 3 

4 1 5 3 2 5 2 3 


3 

3 ， 


With ( 20 ) and ( 21 ) we can identify bj with if k+1 一 j, when 4 > 1 leaves are below 
node k. Then bj is true if and only if the leaves below k have j or more Is* For 
example, answer 23 gets the new clauses 7 看， 6 置， 67f; 5 2 5 4 2 , 45 1 ; 3\, 23?; 1 

1 6 3 63 1 6 3. 

1 2 2 5 1 2 5 1 1 1 5 

Furthermore, ( 20 ) and ( 21 ) can be unified in the same way with the weaker 
constraints r < x\+— • + x n < r. If we want, say，2 < xi + • y +X 7 < 4, we can simply 
replace the final four clauses of the previous paragraph by 1 J 1 , . ?, \ Under the con¬ 
ventions of ( 18 ) and ( 19 )，by contrast，these weaker constraints would generate a compa¬ 


rable number of new clauses, namely 


2 


2 


2 


2 


1 15 2 2 5 3 3 5 4 4 


， K and 1 ! ， 2 ^， 35 ! ， 3H 4! 


2 

3, 


4 , 534 , 5 4 5 5 645 , 65 , 7 5 ; but those clauses involve the new variables 4 


4 ， 5 


2 

4 


2 

5 


31. We can use the constraints on the second line of ( 10 ), together with the constraints 
of exercise 30 that force xi + • • • + x n = r. Then we seek n for which this problem is 
satisfiable, while the same problem with = 0 is not. The following small values can 
be used to check the calculations: 
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(c) x^{C n ) = x(C n ) = 2 when n is even; and x*(C 2 /+i) <2 + 1/1 = n/a(C^+i )， 
because there’s an /-tuple coloring with n colors as in the previous exercise- Also 

> n/a(G) in general: n = J2 v Ej i v ^ s j] = Wjl < a ( G ) A i- 

(d) For the hint，let S = {vi^... ^vi} where vertices are sorted by their colors. 
Since vertex Vj belongs to Ci with \Ci\ > |{巧. ，…， 切}|， we have t Vj < 1/(/ + 1 — j ). 

Sox ⑹ <k = ^2 v t v =^2 v t v < Yjj\ H a{G)- 

[See David S. Johnson, J. Computer and System Sci. 9 (1974) ， 264-269; L. Lovasz, 
Discrete Math. 13 (1975) ， 383—390. The concept of fractional covering is due to A. J. W. 
Hilton, R. Rado, and S- H. Scott ， Bull. London Math. Soc. 5 (1973), 302-306.] 


35. (a) The double coloring below proves that x* (G) > 7/2; and it is optimum because 
NV and its neighbors induce the wheel Wq. (Notice that x*(Wn) = 1 + x*(Cn-i)0 

(b) By part (c) of the previous exercise, x*((7) > 25/4. Furthermore there is a 
quadruple coloring with 25 colors: 


AEUY ABUV BCVW CDWX DEXY 
AEFJ ABFG BCGH CDHI DEIJ 
FJKO FGKL GHLM HIMN IJNO 
KOPT KLPQ LMQR MMRS NOST 
PTUY PQUV QRVW RSWX STXY 



[Is C 5 13 C 5 the smallest graph for which x*(G) < x{G) — 1?] 


36. A few more binary color constraints analogous to ( 16 ) yield the corresponding SAT 
problem. We can also assume that the upper right corner is colored 0， because that 
region touches n + 4 = 14 others; at least n + 6 colors are needed. The constraints 
elsewhere aren’t very tight (see exercise 38(b)); thus we readily obtain an optimum 
radio coloring with n + 6 colors for the McGregor graphs of all orders n > 4, such as 
the one below- An (n + 7)th color is necessary and sufficient when n = 3 or 4. 




37. The 10-coloring shown here is optimum, because Missouri (MO) has degree 8 . 

38. By looking at solutions for n = 10, say, which can be obtained quickly via Algo¬ 
rithm W (Walks AT), it’s easy to discover patterns that work in general: (a) Let (x^y) 
have color (2x + Ay) mod 7. (Seven colors are clearly necessary when n > 3.) (b) Let 
(x,y, z) have color (2x + Qy) mod 9. (Nine colors are clearly necessary when n > 4.) 

39. Let /(n) denote the fewest consecutive colors- SAT solvers readily verify that 
/(n) = (1 ， 3, 5, 7, 8 , 9) for n = (0, 1 ， 2, 3, 4, 5). Furthermore we can exploit symmetry to 
show that f(6)> 10: One can assume that 000000 is colored 0, and that the colors of 
000001, …， 100000 are increasing; that leaves only three possibilities for each of the 
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latter. Finally，we can verify that /(6) = 11 by finding a solution that uses only the 
colors {0,1,3,4, 6,7,9,10}. 

But /(7) is known only to be > 11 and < 15- 

[L(2,1) labelings were named by J. R- Griggs and R. K. Yeh, who initiated the 
theory in SIAM J. Discrete Math. 5 (1992) ， 586-595. The best known upper bounds ， 
including the fact that f(2 k — k — 1) < 2 k ^ were obtained by M. A. Whittlesey ， J. P. 
Georges, and D. W- Mauro, who also solved exercise 38(a); see SIAM J. Discrete Math. 
8 (1995), 499-506.] 

40. No; the satisfiable cases are 2 ： = 0, 1 ， 2, 3, 4, 5, 6, 7, 8, 9, 10 ， 12 ， 14 ， 15, 21. [The 
statement would have been true if we^d also required (x m V … Va> 2 ) 八 (y n V • • • Vy‘ 2 ).] 

41. First there are mn ANDs to form xiyj. A bin that contains t bits initially will 
generate |_ 尤 /2」 carries for the next bin，using (t — 1)/2 adders- (For example, t = 6 will 
invoke 2 full adders and one half adder.) The respective values of t for bin [2], 6m [3], 

• •” bin [m + n + 1] are (1 ， 2, 4 ，6， … ， 2m — 2 ， 2m — 1 ， … ， 2m — 1 ， 2m — 2 ， 2m — 3, 

• • • ， 5, 3 ， 1), with n — m occurrences of 2m — 1- That makes a total mn — m — n full 
adders and m half adders; altogether we get mn + 2(mn — m — n) + m instances of 
AND, mn — m — n instances of OR, and 2(mn — m — n) + m instances of XOR- 

42. Ternary XOR requires quaternary clauses, but ternary clauses suffice for median: 


(tWuW v\/ x) 
(tWuW v\/ x) 
[iWuW vW x) 
(iWuW v\/ x) 


(tWuW v\/ x) 
(tWuW v\/ x) 
(tW uW vW x) 
(tWuW v\/ x) 


(tWuW y) (tW u\/ y) 

(tW vW y) (tW v\/ y) 

(u\/ v\/ y) {u\/ v\/ y) 


These clauses specify respectively that x < > 尤 ㊉ 从 ㊉ t;，y < (tuv)^ y > (tuv). 

43• x = y = 3 works when n = 2, but the cases 3 < n < 7 are unsatisfiable. We can 
use x = 3(2 n " 2 + 1)，y = 7(2 n _ 3 + 1) for all n > 8. (Such solutions seem to be quite 
rare. Another is x = 3227518467, y = 3758194695 when n = 32.) 

44. First scout the territory quickly by looking at all ( A ^) ^ 660 billion cases with at 
most six zeros in x or y; here N = (■■) + (■》） + … + ( 亞 ) • This uncovers the remarkable 
pair x = 2 32 - 2 26 - 2 22 - 2 11 - 2 8 - 2 4 - 1, ? / = 2 32 - 2 11 + 2 8 - 2 4 +1, whose product is 
2 64 — 2° 8 — 2 54 — 2 44 — 2 33 — 2 8 — 1. Now a SAT solver finishes the job by showing that 
the clauses for 32 x 32 bit multiplication are unsatisfiable in the presence of the further 

constraint x\ H - h 无 32 + 沒 i H - h 沒 32 + 乏 i H - h 乏 64 < 15 - (The LIFO version of the 

clauses worked much faster than FIFO in the author’s experiments with Algorithm L. 
Symmetry was broken by separate runs with .. .x\ — 01 &— 1 ，yk ••• yi = 1^-) 

45• Use the clauses for xy = z in the factorization problem, with m = [t/2\^ n = \t/2\^ 
and Xj = yj for 1 < j < m; append the unit clause (y n ) if m < n. 

46. The two largest, 285000288617375 2 and 301429589329949 2 , have 97 bits; the next 
square binary palindrome, 1178448744881657 2 , has 101- [This problem is not easy for 
SAT solvers; number theory does much better. Indeed, there’s a nice way to find all 
n-bit examples by considering approximately 2 n / 3 cases, because the rightmost n/3 
bits of an n/2-bit number x force the other n/6 bits, if x 2 is palindromic. The first 
eight square binary palindromes were found by G. J. Simmons, J. Recreational Math. 
5 (1972), 11-19; all 31 solutions up to 2 90 were found by J. Schoenfield in 2009.] 

47. Each wire has a “top” and a “bottom.” There are n J rg J r2h tops of wires, and 
m + 2g + h bottoms of wires. Hence the total number of wires is n + ^ + 2/i = m + 2p + /i, 
and we must have n + h = m + g. 
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48 . The wires compute q 1 ^ q 2 ^ x p ® q l ^ y ^ q 2 ® z x ® y. Let p 
denote u p stuck at 1 ” while p denotes u p stuck at O' The pattern pqr = 000 detects 
p, q 1 , g 2 , r, x ， y ， 之； 001 detects p ， q 1 ^ g 2 , f ， 仏乏； 010 detects p, q 1 , 豆 2 , r, x, 勺， z] Oil 
detects p, q 1 ^ 豆 2 , r, x, y, z\ 100 detects 歹 , g 1 ， q 2 , t, x, y, z 、101 detects g 1 , q 2 , f, x, 
z] 110 detects 歹，夺， $ ，厂，尤，仏乏 ； HI detects p, 4 4 , 〒， x, y, z. Notice that the 

stuck-at faults for q aren’t detectable (because 2 ： = (p ® g) ® (g ® r) = p ® r); but we 
can detect faults on its clones g 1 , q 2 . (In Fig. 34 the opposite happens.) 

Three patterns such as { 100 , 010 , 001 } suffice for all of the detectable faults- 


49 . One finds，for example, that the faults 6 赢， 續， 5 2 ，and q are detected only by the 
pattern ysy2yiX2Xi = 01111; a 2 , a 3 , b 3 , p, c 2 , Z5 are detected only by 11011 or 11111. 

All covering sets can be found by setting up a CNF with 99 positive clauses, one 
for each detectable fault; for example, the clause for ^5 is X27 V X31, while the clause 


for x\ is V X5 V Xv2 V x\z V X20 V X21 V X28 V X29 - We can find minimum covers from 
a BDD for these clauses, or by using a SAT solver with additional clauses such as (20) 
and (21) to limit the number of positive literals- Exactly fourteen sets of five patterns 
suffice, the most memorable being { 01111 , 10111 ， 11011 , 11101 , 11110 }- (Indeed, every 
minimum set includes at least three of these five patterns.) 


50 . Primed variables for tarnished wires are x’ 2 , h、 2 , U 、 p’, g’, z f 3 ^ c( 2 , z f 5 . Those 
wires also have sharped variables 6 鑑 ，…， ^|; and we need sharped variables xl 2 \ x s 2 \ 
b 2 2 \ bl\ 5 lft , 5 2Jt , c.^, c 2 } for fanout wires. The primed variables are defined 
by clauses such as {p r V a^) A {p r V b r 2 ) A (p f V «3 V 5 ’ 2 ), which corresponds to p f •<— 03 A 62. 
Those clauses are appended to the 49 clauses listed after (23) in the text- Then there 
are two clauses (25) for nine of the ten primed-and-sharped variables; however, in the 
case of X2 we use the unit clauses (x’2) 八 ( 无 2) instead，because the variable x\ doesn’t 
exist. There are five fanout clauses (26)，namely \/ xp V x^) A (6| V 6^ V 6^) A • • • A 
{c\ V 0^2 Vc^). There are eleven clauses \/b\)/\(x^ V63) A (6^ Vs#) 八 … 八(皆 V^) A 
(c^ V z\) for tarnished inputs to gates. And finally there’s (x^) A {z\ V 2:4 V z^). 


51 . (The complete set of 196 patterns found by the author in 2013 included the inputs 
(x,y) = (2 32 — 1,2 31 + 1) and ( 「 2 63 / 2 "|, [2 63 / 2 ]) as well as the two number-theoretic 
patterns mentioned in the text. Long runs of carries are needed in the products.) 


52 . (2:1,2 V2:2,2 V - - - V zm,2) A (zi^Vqi^i) A ( 为 ,2V 色 ,2) 八 ( 为， 2V& ， 3 ) 八（右 ， 2 V 货， 4) 八…八 
( 右 ，2 V 兩， 20 )， for 1 < i < M• The second subscript of z is k in the kth case, 1 < k < P. 


53 . On the left is the binary expansion of 丌， and on the right is the binary expansion 
of e ， 20 bits at a time (see Appendix A). 

One way to define f(x) for all 20 -bit x is to write 7r/4 = Uk/^ 20k and 

e /4 = vi/ 2 201 ^ where each Uk and vi is a 20 -bit number. Let k and l be smallest 

such that x = Uk and x = vi. Then f{x) = [A: < /]. 

Equation (27) has actually been contrived to sustain an illusion of magic: Many 
simple Boolean functions are consistent with the data in Table 2 , even if we require four- 
term DNFs of three literals each. But only two of them, like (27), have the additional 
property that they actually agree with the definition of f(x) in the previous paragraph 
for ten more cases, using Uk up to A: = 22 and vi up to Z = 20 ! One might almost begin 
to suspect that a SAT solver has discovered a deep new connection between 7r and e. 

54 . (a) The function ^1X93:11 无 18 V XQX10X12 V X4X10 无 12 matches all 16 rows of Table 2 ; 
but adding the 17 th row makes a 3 -term DNF impossible. 

(b) 21 rows are impossible，but (27) satisfies 20 rows. 
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(c) x\x^xi 2 xi 7 y xaxsxi^xi^Wxqxqxv 2 X\qWxqxi^xiqX 2 o^Jxi^xiaxiq does 28, which 
is max. (Incidentally, this problem makes no sense for sufficiently large M，because the 
equation f(x) = 1 probably does not have exactly 2 19 solutions.) 

55. Using ( 28 )—( 31 ) with pij = 0 for all i and j, and also introducing clauses like ( 20 ) 
and ( 21 ) to ensure that %,i + ••• + %,‘ 2 o S 3, leads to solutions such as 

/(Xl, •…, X 20 ) = X 1 X 7 X 8 V X 2 X 3 X 4 V X4X13X14 V X 6 ^ 10 ^ 12 - 


(There are no monotone increasing solutions with < 4 terms of any length.) 

56 • We can define f consistently from only a subset of the variables if and only if no 
entry on the left agrees with any entry on the right, when restricted to those coordinate 
positions. For example, the first 10 coordinates do not suffice，because the top entry on 
the left begins with the same 10 bits as the 14th entry on the right. The first 11 coordi¬ 
nates do suffice (although two entries on the right actually agree in their first 12 bits). 

Let the vectors on the left be Uk and vi as in answer 53, and form the 256 x 20 
matrix whose rows are Uk ㊉ 的 for 1 < l < 16 - We can solve the stated problem if 
and only if we can find five columns for which that matrix isn’t 00000 in any row- This 
is the classical covering problem (but with rows and columns interchanged) : We want 
to find five columns that cover every row- 

In general, such an m x n covering problem corresponds to an instance of SAT 
with m clauses and n variables where Xj means “select column j.” The clause for 
a particular row is the OR of the Xj for each column j in which that row contains 1- 
For example, in Table 2 we have m ㊉ vi = 01100100111101111000, so the first clause 
is x ‘2 V X 3 V $6 V • • • V xi 7 . To cover with at most five columns, we add suitable clauses 
according to ( 20 ) and ( 21 ); this gives 396 clauses of total length 2894, in 75 variables- 

(Of course ( 2 5 °) is only 15504; we don’t need a SAT solver for this simple task! 
Yet Algorithm D needs only 578 kilomems, and Algorithm C finds an answer in 353 K/n.) 

There are 12 solutions: We can restrict to coordinates Xj for j in {1,4, 15,17, 20 }， 
{1,10,15,17,20}, {1,15,17,18,20}, {4,6,7,10,12}, {4,6,9,10,12} ， {4,6,10,12,19}, 
{4, 10,12,15, 19} ， {5, 7,11 ， 12, 15} ， {6, 7, 8 ,10, 12} ， { 6 , 8 , 9,10, 12} ， {7, 10,12,15, 20}，or 
{8, 15,17,18, 20}- (Incidentally, BDD methods show that the number of solutions to the 

covering problem has the generating function 12z b -\-994z 6 + 135032： 7 H - \-20z 19 + z 20 ^ 

counting by the size of the covering set.) 

57 . Table 2 specifies a partially defined function of 20 Boolean variables, having 2 20 —32 
“don’t-cares •” Exercise 56 shows how to embed it in a partially defined function of only 
5 Boolean variables, in twelve different ways. So we have twelve different truth tables: 


11110110 0 * 1 * 010 * 10000111 10 * 0 * 1*0 
011 * 011 * 1*110100 10 * 001*1 1000**10 
011 * 1*11 010 * 100 * 10 * 0*000 * 101*011 
10101110 0 * 100 * 1 * 1 * 001*00 1 氺氺 00 氺氺氺 
10101110 0 * 1 * 0*10 1 * 0 * 1*00 0 ** 01 *** 
1 * 01110 * 00 ** 110 * 11 ** 0*00 10*****0 


00100101 11110 * 0 * 1011 **** ** 0 ** 00 * 
100 * 1**0 11*00010 1100 ** 0 * * 0**0101 
** 1*1000 1*101100 1 * 100*10 0 ***** 1 * 
1 * 1 * 1*10 10001100 0 * 101 * 1 * ** 1 * 0*10 
1 * 01 * 00 * 1101 * 0*0 0011 * 11 * 1 * 100 * 0 * 
001*1001 * 1 ** 1 * 1 * 11 * 0*010 01011001 


And the tenth of these yields f(x) = (( 尤 8 ㊉ ($9 V xio)) V (( 尤 6 V ri‘ 2 ) ㊉ 无 io)) ㊉ ^ 12 . 

58. These clauses are satisfiable whenever the other clauses are satisfiable (except in 
the trivial case when f{x) = 0 for all x)^ because we don’t need to include both Xj and 
Xj in the same term. Furthermore they reduce the space of possibilities by a factor of 
(3/4 ) a . So they seem worthwhile- (On the other hand, their effect on the running time 
appears to be negligible，at least with respect to Algorithm C in small-scale trials.) 


cardinality constr 
covering problem 
BDD 

generating function 
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59 . f (x) ㊉ f (x) = $2 无 3 无 6 无 lo 无 12( 无 8 Vx8(a ： i3 Va ： i5)) is a function of eight variables that 
has 7 solutions* Thus the probability is 7/256 = . 02734375 . 

60 . A typical example with 32 given values of f(x), chosen randomly，yielded 

A 

f(Xi ， … ^X2o) = X4X7X12 V X6X8X11X14X20 V X9X12X18X19 V X13X16X17 尤 19 , 

which of course is way off; it differs from f(x) with probability 102752 / 2 18 ^ . 39 . With 
64 training values, however ， 

A 

/(XI， • • • ^X2o) = T 2 X 13 X 15 T 19 V ^ 3 ^ 9 ^ 19^20 V X6X10 无 12 V ^8^10^12 

comes closer, disagreeing only with probability 404 / 2 11 ^ . 197 . 

61 . We can add 24 clauses (p a ,i V q a ,i V p a ,2 V q a ;i V p a ，3 V q a ,3 V • • • V Pb ， iVq b ,i V • • • V 
Vc,\N q c ,\M • • •Vpj， 1 V 如 ，： l V. • • Vpd ， 10 V^， 10 V. • • Vpdjo V 似， 20)，one for each permutation 
abed of { 1 , 2 , 3 , 4 }; the resulting clauses are satisfiable only by other functions f(x). 

But the situation is more complicated in larger examples, because a function can 
have many equivalent representations as a short DNF. A general scheme, to decide 
whether the function described by a particular setting and of the ps and qs is 
unique, would be to add more complicated clauses, which state that pij and qi，j give 
a different solution. Those clauses can be generated by the Tseytin encoding of 

M N M N 

V 八((负 ， jA%)Vd 為 •)）© V 八傾 ,j A%) v (q id Axj)). 

i=l j = l i = l j = l 

62 . Preliminary experiments by the author, with TV = 20 and p = 1 / 8 , seem to 
indicate that more data points are needed to get convergence by this method, but the 
SAT solver tends to run about 10 times faster. Thus, locally biased data points appear 
to be preferable unless the cost of observing the hidden function is relatively large- 

Incidentally, the chance that x( k ) = x( k_1 ) was relatively high in these experiments 
(( 7 / 8 ) 20 » . 069 ); so cases with y ⑻ =Q were bypassed- 

63 . With Tseytin encoding (24) ， it’s easy to construct 6r + 2 n — 1 clauses in 2 r + 2 n — 1 
variables that are satisfiable if and only if a fails to sort the binary sequence xi ... x n . 
For example, the clauses when a = [ 1 : 2 ][ 3 : 4 ][ 1 : 3 ][ 2 : 4 ][ 2 : 3 ] are (xiWli) A (X2V/1 ) 八 
(xiVx 2 V/i) A(xiV/ii) A(x 2 V_/ii) A(xiVx 2 V/ii) A - • • A(/ 4 V/" 5 ) A(/i3V[ 5 ) A([ 4 V/i3Vj5) A 
(/ 4 V/i 5 )A 03 V/15) 八 (/ 4 V/i 3 V/i5)A (giWg 2 \/gs)/\(gi V/ 3 )A (giWl 5 ) A feV/ 5 )A (g 2 \/h 5 ) A 
( 歹 3V/i5) 八 (^3V/i4) - They’re unsatisfiable, so a always sorts properly* 

64 . Here we reverse the policy of the previous answer, and construct clauses that are 
satisfiable when they describe a sorting network: Let the variable Cjj stand for the 
existence of comparator [i:j] at time ^ for 1 < i < j < n and I < t <T. Also, adapting 
(20) and (21), let variables B^ k be defined for 1 < j < n —2 and 1 < k < n ，with clauses 

B 2 j +1 ， k ) f\ ^j,k) ^ B》, k ) /\ Bj^ k )\ (*) 

in this formula we substitute {C\^ k ^ ••” C t k __ 1 ^ k ^C t k ^ k+1 ^ •… ， C^， n } for the n — 1 “leaf 
nodes” {B t n _ l ^ k ^... ^ ^2 n _ 3?/c }. These clauses prohibit comparators from clashing at 
time ^ and they make B^ k false if and only if line k remains unused. 

If x = xi • • • is any binary vector, let yi … y n be the result of sorting x (so that 
(yi … y n 、2 = 2 UX — 1 ). The following clauses F(x) encode the fact that comparators Cjj 

transform x ^ y: VT^,^/ V ^~ 1 ) A V V^V V ^~ 1 ) V^ 1 ) A 

(cij V 1 v v ^~ 1 ) a (cij v vlj vv^Aicl^vl, 1 ) /\ ⑻， ,vF^vi^-^a 
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for 1 < i < j < n and 1 < t < T; here we substitute Xj for V^j and 
also substitute yj for V^j 5 thereby simplifying the boundary conditions. 

Furthermore, we can remove all variables V^i when x has i leading Os and V^j 
when x has j trailing ls，replacing them by 0 and 1 respectively and simplifying further. 

Finally，given any sequence a = [ii :ji ].. .[i r : jV] of initial comparators, T further 
parallel stages will yield a sorting network if and only if the clauses (*)，together with 
/\ x F(x) over all x producible by a，are simultaneously satisfiable. 

Setting n = 9, a = [1:6][2:7][3:8][4:9]，and T = 5， we obtain 85768 clauses 
in 5175 variables, if we leave out the ten vectors x that are already sorted. Al¬ 
gorithm C finds them unsatisfiable after spending 

A 

roughly 200 megamems; therefore T(9) > 6. (Algo¬ 
rithm L fails spectacularly on these clauses, how¬ 
ever.) Setting T l 6 quickly yields T(9) < 7. 

D. Bundala and J. Zavodny [LNCS 8370 (2014), 

236-247] used this approach to prove in fact that 
T(ll) = 8 and f (13) = 9. Then T. Ehlers and 
M. Muller extended it [arXiv: 1410.2736 [cs.DS] 

(2014)，10 pages]，to prove that 宁 (17) = 10， with 
the surprising optimum network shown here. 

65. (a) The goal is to express the transition equation in CNF* There are (^) clauses like 

(x \/ x a y Xb\/ x c y Xd)^ one for each choice of four neighbors {a, 6, d] C {nw, n ， …， se}. 

Also ( 零 ） clauses like \J x a M … M Xg), one for each choice of seven. Also (g) like 

[x 1 MxMx a ^J - .Vxf), for each choice of six. Also ( 重 ) like (x^XaVxbVxcVxdV- • Vx/J ， 

complementing just three. And finally (®) like (x 7 V x V V V V • • • V x g )^ 

complementing just two and omitting any one of the others. Altogether 70 + 8 + 28 + 

56 + 28 = 190 clauses of average length (70 - 5 + 8- 8 +28- 8 +56- 9 +28 - 9)/190 ^ 7-34. 

(b) Here we let x = Xij, x N w = • • •, x SE = x = x\j. There 

# • # • 

are seven classes of auxiliary variables , … ， g]^, each of which has two children; 
the meaning is that the sum of the descendants is > k. We have k G {2, 3, 4} for the 
a variables 5 k G {1 ， 2, 3, 4} for the b and c variables，and k G {1 ， 2} for d ， e ， /， 仏 



The children of are and c u . The children of are cp 心一 and 

W+Q&2)). The children of c u are /〆’ and , where i' = i-\-2 and j' = (j — 1) 11 if i is 
odd, otherwise i = i and / = j — (j&l). The children of d lJ are X(^_i)(j + i) and 
The children of e u are ； l) and The children of f 1 」are and 

Finally, the children of g 2J are x^j and where % = i + 1 —((i&l) 《 l); 

and (〆’,/’）= (i + 1， j ㊉ 1) if i is odd, otherwise (〆’,/’）= (i — 1， j — 1 + ((j & 1) 《 1 )). 
(OK —— this isn’t elegant. But hey, it works!) 

If the children of p are q and r, the clauses that define pk are (pk V V f^//) for 
k f + k n = k and (pk V V r^n) for k r + k n = A: + 1- In these clauses we omit or fo ； 

we also omit q m or r m when q or r has fewer than m descendants. 

For example, these rules define df 5 and d^ 0 by the following six clauses: 

(dfv 免 26), ⑷ 35 V 元 36), (4 5 V 元 26 V 无 36), (J? 5 Vx 26 V 尤 36), (JfVx 26 ), (,V 灼 6 ). 

♦ 參 # 參 ♦籲 

The variables b 1 ^ are defined only when i is odd; d 1 ^ and e 2 ^ only when i is odd and 
j mod 4 < 2; only when i + j is even. Thus the total number of auxiliary variables 
per cell ignoring small corrections at boundary points, is 3 + 4/2 + 4 + 2/4 + 2/4 + 
2/2 + 2 = 13 of types a through not 19, because of the sharing; and the total number 

of clauses per cell to define them is 21 + 16/2 + 16 + 6/4 + 6/4 + 6/2 + 6 = 57, not 77. 
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Finally we define x\j from a?/ ， ag ， a 2 ^ by means of six clauses 

{x'ijWai), (xijVxijWa^), (x ^ Va^ Va^), «V%V_), Va z 4 J Va^), 

where yij is another auxiliary variable (introduced only to avoid clauses of size 4). 

66. All solutions to (a) can be characterized by a BDD of 8852 nodes, from which we 
can obtain the generating function 38 之 28 +5502： 29 + • • • + 150z 41 that enumerates them 
(with a total computation time of only 150 megamems or so). Part (b), however, is 
best suited to SAT, and Xq must have at least 38 live cells. Typical answers are 






67. Either ^ or ^ at lower left will produce the Xq of ( 37 ) at time 1. But length 22 
is impossible: With r = 4 we can verify that all the live cells in X 4 lie in some 3x3 
sub array. Then with r = 22 we need to rule out only ((^) + (!) + (【))x 6 = 2016 
possibilities，one for each viable X 4 within each essentially different 3x3 sub array- 


68. The author believes that r = 12 is impossible, but his SAT solvers have not yet 
been able to verify this conjecture- Certainly r = 11 is achievable, because we can 
continue with the text’s fifth example after prepending 




69• Since only 8548 essentially different 4x4 bitmaps are possible (see Section 7.2.3 )， 
an exhaustive enumeration is no sweat. The small stable patterns arise frequently, so 


they Ve all been named: ^ 

bee- 

(a) block tub boat ship snake hive 

m SS S S S S 


long long 

carrier barge loaf eater boat ship pond 

S IlBJ 1 I 1 1 ■ LffJ I ■TT1 UPO 

lOij ■ \_m pOhj puu ■ i ■ 

I \ m\ rm 1 mmjj 


(b) 


blinker 

困 


clock 

11 ■ 1 1 ■ n 


toad 


beacon 

Ho 国 


(A glider is also considered to be stable, although it’s not an oscillator.) 


70 . (a) A cell with three live neighbors in the stator will stay alive- 

(b) A 4 x n board doesn’t work; Fig. A -6 shows the 5x8 examples. 

(c) Again, the smallest-weight solutions with smallest rectangles are shown in 
Fig. A- 6 . Oscillators with these rotors are plentiful on larger boards; the first examples 
of each kind were found respectively by Richard Schroeppel (1970)，David Buckingham 
(1972), Robert Wainwright (1985). 

71 . Let the variables Xt = Xijt characterize the configuration at time ^ and suppose 
we require X r = Xo. There are q = 8r automorphisms a that take Xt X^ t+p ^ mod r 丁， 
where 0 < p < r and r is one of the eight symmetries of a square grid. 

Any global permutation of the N = n 2 r variables leads via Theorem E to a 
canonical form, where we require the solution to be lexicographically less than or equal 
to the q — 1 solutions that are equivalent to it under automorphisms. 

Such lexicographic tests can be enforced by introducing (g — 1)(37V —2) new clauses 
of length <3, as in ( 169 ) — and often greatly simplified using Corollary E. 

These additional clauses can significantly speed up a proof of unsatisfiability. On 
the other hand they can also slow down the search, if a problem has abundant solutions. 
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In practice it’s usually better to insist only on solutions that are partially canonical, 
by using only some of the automorphisms and by requiring lexicographic order only on 
some of the variables. 

72. (a) The two 7 x 7s, shown in Fig. A— 6, were found by R. Wainwright (trice tongs ， 
1972) and A* Flammenkamp (jam, 1988). 

Van de trice 

Omega Graaff J3 genie copter tongs jam spinners infinity 

圓 ia 圍圍圖圍圍 __ 

Fig. A—6. Noteworthy minimal oscillators of periods 2 and 3. 
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quilt patterns 


(b) Here the smallest examples are 9 x 13 and 10 x 15; the former has four L - rotors 
surrounding long stable lines- Readers will also enjoy discovering 10 x 10 and 13 x 13 
instances that have full eightfold symmetry. (When encoding such symmetrical prob¬ 
lems by using exercise 65(b), we need only compute the transitions between variables 

xtij for 1 < i < [m/2] and 1 < j < \n/2\] every other variable is identical to one of 

# • # • 

these. However, the auxiliary variables a 13 , … ， g 2J shouldn’t be coalesced in this way-) 
(c ， d) Champion heavyweights have small rotors. What a cool four-way snake dance! 


■: itivj 

innilllllinn 

130/240 « .54 132/256 « .52 

73. (a) They don’t have three A neighbors; and they don’t have three B neighbors. 

(b) Two examples appear in Fig. A—7, where they are packed as snugly as possible 
into a 12 x 15 box. This pattern, found by R. W. Gosper about 1971， is called the 
phoenix, since its living cells repeatedly die and rise again* It is the smallest mobile 
flipflop; the same idea yields the next smallest (also seen in Fig. A—7)，which is 10 x 12. 

(c) The nonblank one comes from a 1 x 4 torus; the checkerboard from an 8 x 8. 
Here are some amazing m x n ways to satisfy the constraints for small m and n: 



120/225 ^ .53 



120/225 ^ .53 136/256 ^ .53 



Notice that infinite one - dimensional examples are implied by several of these motifs; 
the checkerboard, in fact, can be fabricated by placing 


A 

A 



|b 


B 



A 


A 



B 


B 


diagonals together. 
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Fig. A— 7. Mobile flipflops: An ideal way to tile the floor of a workspace for hackers. 


74. Call a cell tainted if it is A with more than one A neighbor or B with more than 
one B neighbor. Consider the topmost row with a tainted cell, and the leftmost tainted 
cell in that row- We can assume that this cell is an A ， and that its neighbors are S ， T ， 
U, V, W, X ， Y, Z in the pattern H- Three of those eight neighbors are type B，and 


at least four are type A; several cases need to be considered. 


Case 1: W = X = Y = Z = A. Then we must have S = U = V = B and T = 0 
(blank), because S ， T ， U，V aren’t tainted. The three left neighbors of V can’t be 
type A, since V already has three A neighbors; nor can they be type B, since V isn’t 
tainted. Hence the tainted X, which must have two B neighbors in the three cells below 
it, cannot also have two or more A neighbors there- 

Case 2: T = A or V = A. If, say, T = A then X = Y = Z = A, and neither V 
nor W can be type B. 


Case 3: S _ A, U = A. Then W can’t be type B, and S must be tainted- 


Case 4: S = A, U _ A. At least one of W, X, Y, Z is B; at least three are A; so 
exactly three are A. The B can’t be Y, which has four A neighbors. Nor can it be W 
or Z: That would force V to be blank, hence T = U = B; consequently W = A, Z = B. 
Since W is tainted, at least two of its right neighbors must be A, contradicting Z = B. 

Thus X = B in Case 4. Either T or V is also B, while the other is blank; say 

T is blank. The three left neighbors of V cannot be A. So they must either all be B 

(tainting the cell left of S) or all blank. In the latter case the upper neighbors of T must 
be BBA in that order, since T is blank. But that taints the B above T. A symmetric 
argument applies if V is blank. 

Case 5: S = U = A. Then W ^ A, and at least two of {X, Y, Z} are A. Now 
Y = Z = A forces T = V = X = B and W blank, tarnishing V. 

Similarly, X = Y = A forces T = W = Z = B and V blank; this case is more 

difficult- The three lower neighbors of Y must be AAB, in that order, lest a B be 

surrounded by four A’s. But then the left neighbors of X are BBB; hence so are the 
left neighbors of V, tarnishing the middle one. 

Finally, therefore, Case 5 implies that X = Z = A. Either T, V, W，or Y is blank; 
the other three are B. The blank can’t be T, since T，s upper three neighbors can’t 
be A* It can’t be W or Y, since V and T aren’t tainted. So T = W = Y = B and V is 
blank. The left neighbors of S cannot be A, because S isn’t tainted. So the cell left of X 
must be A. Therefore X must have at least four A neighbors; but that’s impossible, 
because Y already has three. 
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Diagonally adjacent A’s are rare. (In fact, they cannot occur in rectangular grids 
of size less than 16 x 18.) But diligent readers will be able to spot them in Fig. A— 7, 
which exhibits an astonishing variety of different motifs that are possible in large grids. 

75. Let the cells alive at times p — 2, p — 1， p be of types X ， Y ， Z, and consider the 
topmost row in which a live cell ever appears- Without loss of generality, the leftmost 
cell in that row is type Z. The cell below that Z can’t be of type Y, because that Y 
would have three X neighbors and four Y neighbors besides Z and the blank to its left. 

Thus the picture must look like ;||! ， where the three predecessors of Z and the 
topmost Y are filled in. But there’s no room for the three predecessors of the topmost X. 


Summers 

torus 

Rokicki 

still life 

symeater 

blocks 

carriers 

Buckingham 

Silver 

pixels 

glider’s symmetry 


76. The smallest known example, a 28x33 pattern found 
by Jason Summers in 2012, is illustrated here using the 
letters {F,A, B}, {B,C ， D} ， {D ， E ， F} for cells that are 
alive when t mod 3 = 0, 1 ， 2. His ingenious construction 
leads in particular to an infinite solution based on a 7 x 24 
torus. An amazing infinite 7x7 toroidal pattern also 
exists, but little else is yet known. 




77. If the first four cells in row 4 of Xq (and of X 5 ) contain a, 6, c ， d ， we need a+6 七 1 ， 
a + 6 + c_l ，6 + c + d 七 2. In clause form this becomes 及 V 6 , a V h\! c, cM b\/ c\/ d. 

Similarly, let the last four elements of column 5 be (f,g ， h ， i)' then we want f + 
g + h ★ 2, 分 + /i + i_2, h + i ^2. These conditions simplify to / V / V 歹 V h\/i. 

78. The “9 2 phage” in Fig. A -8 is a minimal example. 

79. (Solution by T. G. Rokicki.) A tremendous battle flares up, raging wildly on all 
fronts. When the dust finally settles at time 1900， 11 gliders are escaping the scene 
(1 going in the original NE direction, 3 going NW，5 going SW, and 2 going SE), leaving 
behind 16 blocks, 1 tub, 2 loaves, 3 boats, 4 ships, 8 beehives, 1 pond, 15 blinkers, and 
1 toad. (One should really watch this with a suitable applet.) 

80. Paydirt is hit on 10 x 10 and 11 x 11 boards, with Xg = Xq ； see Fig. A- 8 . The 
minimal example, “symeaterl9，” has a close relative, “symeater20，” which consists 
simply of two blocks and two carriers, strategically placed. (The first of these, also 
called “eater 2，” was discovered by D. Buckingham in the early 1970s; the other by 
S. Silver in 1998.) They both have the additional ability to eat the glider if it is moved 
one or two cells to the right of the position shown, or one cell to the left. 

It is important to realize that the diagonal track of a glider does not pass through 
the corners of pixels，bisecting them; the axis of a glider’s symmetry actually passes 
through the midpoints of pixel edges, thereby cutting off small triangles whose area is 
1/8 of a full pixel. Consequently, any eater that is symmetric about a diagonal will 
eat gliders in two adjacent tracks- The two in Fig. A -8 are exceptional because they’re 
quadruply effective. Furthermore symeater20 will eat from the opposite direction; and 
either of its carriers can be swapped to another position next to the blocks. 

81. Two eaters make “ssymeaterl4” (Fig. A- 8 ); and “ssymeater22” is narrower. 

82. (a) If X X ’ ，then x\j = 1 only if we have S7^7-1 x i'y ^ ^ - 

(b) Use the same inequality 5 and induction on j. 
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Fig. A— 8 . Various examples of minimal still lifes that eat gliders and spaceships- 
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(c) (Proof of the hint by John Conway ， 1970-) In the transitions 


X 


X 


we must have 囲 in the center of X ’； hence we must have m at the lower left of X. 
But then the center of X, is 昭 . 

83. Work with (2r + 1 — 2t) x (2r + 1 — 2t) grids xtij centered at cell (io, jo )， for 


0 < t < r = /(io, jo); and assume that xtij = 0 whenever > 

if (io, jo) = (1,2 )， only 14 of the xsij can be alive, namely when (i,j) 
(—2. .0,1), (—2 .. 1,0), (—2.. 2, —1). The case (io,jo) = (1,2) leads 
satisfiable clauses on 1316 variables, including the unit clause X 6 i 2 ^ 
transitions are encoded as in answer 65; all but 106 of those variables 


t. For example, 
=(—2.. —1，2)， 
to 5031 readily 
when the state 
are auxiliary* 



84. (a) Use a glider，positioned properly with its tip at (0,0). 

(b) Similarly, a spaceship reaches these cells in the minimum possible time. 

(c) Consider patterns A n = PWW and B n = of width 2n + 1， illustrated 

here for n = 3. Then Bj works when j mod 4 G {1,2}; Aj and Bj 一 i work when 
j mod 4 G {2, 3}; Aj-i works when j mod 4 6 {0, 3}. 

(d) The pattern 固 assembles a suitable glider at time 3- 

(e) A SAT solver found the pattern shown here, which launches an appro¬ 
priate spaceship (plus some construction debris that vanishes at t = 5)- 

[It appears likely that /*(i, j) = for all i and j. But the best general 

result at present，based on space-filling constructions such as Tim Coe’s “Max，” is that 
/*(之 J) = /(^ j) + 0(1)- There’s no known way to prove even the special cases that ， 
say, f* (j, 2j) = 6 j or that 2j) = 3j for all j > 0.] 

85. (a) Let X be a 12 x 12 bitmap. We must show that the clauses T\X^X f ) of 
exercise 65, together with 92 unary clauses ^ 24 , W 25 , … from the given pattern, are 
unsatisfiable. (The pattern is symmetrical; but Life’s rules often produce symmetrical 
states from unsymmetrical ones.) Thus 2 144 — 8 different conceivable predecessor states 
need to be ruled out- Fortunately Algorithm C needs fewer than 100 Mp to do that- 

(b) Most states have thousands of predecessors (see the following exercise); so 
Algorithm C can almost always find one in, say, 500 Kfi. Therefore one can prove, for 
example, that no 6 x 6 Gardens of Eden exist, by rapidly finding a predecessor for each 
of the 2 36 patterns. (Only about 2 36 /8 patterns actually need to be tried, by symmetry*) 
Furthermore, if we run through those patterns in Gray code order, changing the polarity 
of just one assumed unary clause 士 at each step，the mechanism of Algorithm C 
goes even faster, because it tends to find nearby solutions to nearby problems- Thus 
thousands of patterns can be satisfied per second, and the task is feasible- 
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Such an approach is out of the question for 10 x 10 bitmaps, because 2 100 》 2 36 . 
But we can find all 10 x 10 Gardens of Eden for which there is 90°-rotational symmetry, 
by trying only about 2 2b /2 patterns, again using Gray code. Aha: Eight such patterns 
have no predecessor, and four of them correspond to the given orphan. 

[See C. Hartman, M. J. H. Heule, K. Kwekkeboom, and A. Noels, Electronic J. 
Combinatorics 20,3 (August 2013), #P16. The existence of Gardens of Eden with 
respect to many kinds of cellular automata was first proved nonconstructively by E. F. 
Moore ， Proc. Symp. Applied Math. 14 (1962), 17—33.] 

86 . The 80 cells outside the inner 8 x 8 can be chosen in iV = 11 ， 984,516,506,952,898 
ways. (A BDD of size 53464 proves this.) So the answer is _/v/2 100-64 « 174,398. 

87. Instead of using subscripts t and t + 1, we can write the transition clauses for 
X ^ X f in the form (◎ V A 0 V A 0 ’)， etc. Let Alice’s states be {ai ， •… 5 a p } and let 
Bob’s be {fii, …， fi q }. The clauses (@ V V a\) and (◎ V 成 V 戌 ) say that your state 
doesn’t change unless you are bumped. If state a corresponds to the command ‘Maybe 
go to s’，the clause (@ Va Va 7 V s) defines the next possible states after bumping. The 
analogous clause for ‘Critical, go to 5 ’ or ‘Set v <— go to is simply (◎ V a V s’）; 
and the latter also generates the clause (@ V a V v r ) if 6 = 1， (@ V a V v) if 6 = 0. The 
command c If v go to si, else to so’ generates a\/ v W s[) A aW v\/ s f 0 ). And 
for each variable if the states whose commands set v are ai 1 , … ， ai h ^ the clauses 

(@\/ v\/ a h \/ - -W a ih \/ v) A (@\/ v W a h W - • W a ih W v) 

encode the fact that v isn’t changed by other commands. 

Bob’s program generates similar clauses —— but they use ◎，not ◎, and not a. 
Incidentally, when other protocols are considered in place of ( 40 ), the initial 
state Xo analogous to ( 41 ) is constructed by putting Alice and Bob into their smallest 
possible states, and by setting all shared variables to 0 - 

88 . For example, let all variables be false except A0o ， B0o ， ◎() ， All, B0i ， Al‘ 2 , Bl‘ 2 , 

AI3, B23, @3, A24, B24, @4, A35, B25, /s, A 3 e, B 3 e, < 6 . 

89. No; we can find a counterexample to the corresponding clauses as in the previous 
exercise: A0o, B0o, A0i ， Bli, AO 2 , B2 2 , 62 , @ 2 , Al 3 , B2 3 , 63 , Al 、 B 34 , 64 , Al 5 ， B4 5 , 
65 , @ 5 , A 2 e ， B46, a 6 , & 6 , @ 6 , A 57 , B 47 , a 7 , 67 , A5s ， B 2 g ， as ， 6 s ， , 8 , A5g, B 59 , ⑽， 69 , , 9 . 

(This protocol was the author’s original introduction to the fascinating problem of 
mutual exclusion [see CACM 9 (1966) ， 321-322 ， 878], about which Dijkstra had said 
“Quite a collection of trial solutions have been shown to be incorrect ?’） 

90. Alice starves in ( 43 ) with p = 1 and r = 3 in ( 47 ), if she moves to A 1 and then 
Bob remains in B0 whenever he is bumped. The A2 A B2 deadlock mentioned in the 
text for ( 45 ) corresponds to ( 47 ) with p = 4 and r = 6 . And in ( 46 )，successive moves 
to Bl, (B2, Al, A2, B3, Bl, A4, A5, A0)°° will starve poor Bob. 

91. A cycle ( 47 ) with no maybe/critical states for Alice can certainly starve her. 
Conversely, given (i), (ii )， (iii)，suppose Alice is in no maybe/critical state when t > to] 
and let to < ti < t 2 < • • • be times with ◎“ = 1 but with @ t = 0 for at least one t 
between ti and ^+ 1 . Then we must have Xt { = Xt- for some i < j, because the number 
of states is finite. Hence there’s a starvation cycle with p = ti and r = tj. 

92. For 0 < i < j < r we want clauses that encode the condition Xi ^ Xj • Introduce 
new variables aij for each state a of Alice or Bob, and Vij for each shared variable v. 
Assert that at least one of these new variables is true. (For the protocol ( 40 ) this clause 


90°-rotational symmetry 

Hartman 
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nonconstructively 
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transition clauses 
initial state 
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would be (AOij V • • • V A4y V BOij V • • • VB4y V Uj)-) Also assert the binary clauses (&ij V 
Ci)A (&ij \JGj) for each a，and the ternary clauses (vij WviWvj) A (vij V Vi V Vj ) for each v. 

The transition clauses can also be streamlined, because we needn’t allow cases 
where Xt+i = Xt. Thus, for example, we can omit B0^+i from the clause (◎，V B07 V 
BO^+i V Blf+i) of ( 42 ); and we can omit the clause (◎，V BT7 V ^ V Bl^+i) entirely. 

[If r is large, encodings with 0(r (log r) 2 ) clauses are possible via sorting networks, 
as suggested by D, Kroening and O. Strichman, LNCS 2575 (2003) ， 298-309. The 
most practical scheme, however, seems to be to add the ij constraints one by one 
as needed; see N, Een and N* Sorensson, Electronic Notes in Theoretical Computer 
Science 89 (2003), 543-560.] 

93. For the $ in ( 50 )，for example, we can use (xi V X 2 V • - V xi6) A (xi V A0’) A • • • A 
(xiVA6 r )A(x2VW)A--A(x2VW)A(x3VA0 / )A(x 3 Va / )A--A(xi 6 VB6 / )A(x 1 6Vb / ). 

94. (X X f ^ ^ X (r) ) A $(X) A A •• • A $(X (r_1) ) A -$(X (r) ). [This 

important technique is called “^-induction” ； see Mary Sheeran, Satnam Singh, and 
Gunnar Stalmarck, LNCS 1954 (2000) ， 108-125 - One can, for example, add the clause 
(A5 V B5) to ( 50 ) and prove the resulting formula $ by 3-induction.] 

95. The critical steps have a = 6 = 1, by the invariants, so they have no predecessor* 

96. The only predecessor of A 5‘2 八 B 5*2 A a *2 A 62 A I 2 is A5i A B4i Aai A 61 A [i ； and the 
only predecessor of that is A5o A B3o A ao A 60 A lo. The case 1 2 is similar. 

But without the invariants，we could find arbitrarily long paths to A5 r A B5 r . In 
fact the longest such simple path has r = 33: Starting with A2o A B2o 八 ao 八 60 八 Zo, we 
could successively bump Alice and Bob into states A3, A5, A 6 , A0, Al, A2, A3, B3, 
B4, A5, B3, A 6 , B4, A0, B3, Al, A2, A3, A5, A 6 , A0, Al, A2, B4, A3, A5, A 6 , A0, 
B5, Al, A2, A3, A5, never repeating a previous state- (Of course all of these states are 
unreachable from the real Xo^ because none of them satisfy $•) 

97. No. Removing each person’s final step in a path to A 6 AB 6 gives a path to A5 AB5- 

98. (a) Suppose Xo 4 4 r = 為 is impure and Xi = Xj for some 0 < i < j < r- 

We may assume that i = 0. If either of the two cycles Xo ^ Xj = Xo or 

Xj 4 … 4 X r = Xj is impure, it is shorter. 

(b) In those states she would have had to be previously in A0 or A5. 

(c) Generate clauses (g 0 ), (gt V g t ^i V (h 0 ), (h t V h t -i V 斷一丄 ) ， （ f t V g t ), 

(J t W ht) ， （ft V ao V at), (ft V ao V at), (ft V t；o V t ； t), (ft W vo\/ vt)^ for 1 < f < r; and 
(/i V /2 V … V / r ). Here v runs through all shared variables 5 and a runs through all 
states that can occur in a starvation cycle. (For example, Alice’s states with respect 
to protocol ( 49 ) would be restricted to A3 and A4, but Bob’s are unrestricted.) 

(d) With exercise 92 we can determine that the longest simple path, using only 
states that can occur in a starvation cycle for ( 49 )，is 15 - And the clauses of (c) are 
unsatisfiable when r = 15 and invariant ( 50 ) is used- Thus the only possible starvation 
cycle is made from two simple pure cycles; and those are easy to rule out¬ 
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Kroening 
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99. Invariant assertions define the values of a and b at each state. Hence mutual 
exclusion follows as in exercise 95 - For starvation-freedom, we can exclude states A0, 
A6, A7, A8 from any cycle that starves Alice. But we need also to show that the state 
A5t A B0t A It is impossible; otherwise she could starve while Bob is maybe-ing. For 
that purpose we can add -i((A6 V A7 V A8) A (B6 V B7 V B8)) 八 -n(A8 Al) A -i(B8 A /) A 
i((A3 V A4 V A5) A B0 A /) A ， (A0 八 （ B3 V B4 V B5) A l) to the invariant $(X). The 
longest simple path through allowable states has length 42; and the clauses of exercise 
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98(c) are unsatisfiable when r = 42. Notice that Alice and Bob never compete when 
setting the common variable because states A7 and B7 cannot occur together. 

(See Dijkstra’s Cooperating Sequential Processes, cited in the text-) 

100. Bob is starved by the moves Bl ， (Al ， A2 ， A3 ， B2 ， A4 ， B3 ， AO ， B4 ， Bl)°°. But 
an argument similar to the previous answer shows that Alice cannot be. 

[The protocol obviously provides mutual exclusion as in exercise 95 - It was devised 
independently in the late 1970s by J. E. Burns and L. Lamport, as a special case of an 
A^-player protocol using only N shared bits; see JACM 33 (1986) ， 337-339.] 

101. The following solution is based on G. L. Peterson’s elegant protocol for N pro¬ 
cesses in ACM Transactions on Programming Languages and Systems 5 (1983) ， 56-65: 


A0. Maybe go to Al. 

Al. Set ai [ 1, go to A2* 

A2. If 62 go to A2, else to A3. 
A3. Set a ‘2 1， go to A4. 

A4. Set ai 0, go to A5. 

A5. If 61 go to A5, else to A 6 . 
A 6 . Set ai 卜 1, go to A7- 
A7. If 61 go to A 8 , else to A9. 
A 8 . If 62 go to A7, else to A9. 
A9. Critical, go to A10- 
A10- Set ai 0, go to All. 

All. Set a *2 0, go to A0. 

(Alice and Bob might need an 
app to help them deal with this.) 


B0. Maybe go to Bl. 

Bl. Set 61 1， go to B2* 

B2. If ai go to B2, else to B3. 
B3. Set 62 1, go to B4. 

B4. Set 61 0, go to B5- 

B5 - If a *2 go to B5, else to B 6 . 
B 6 . Set bi 1, go to B7. 

B7. If ai go to B 8 , else to B12. 
B 8 . If a *2 go to B9, else to B12. 
B9. Set 61 0, go to BIO- 

B10. If ai go to Bll, else to B 6 . 
B1L If a ‘2 go to B10, else to B 6 . 
B12. Critical，go to B13. 

B13 - Set bi 0, go to B14. 

B14. Set 62 0, go to B0. 


102. The clauses for, say ， ‘B5. If a go to B 6 , else to B7should be (◎ V B5 V a V ai V 
• • • V V B 6 ’ ） A(@VB5VaVaiV---Va p V B7’）A (@ V B5 V B 6 7 V B7 ’)， where ai ， 
…， are the states in which Alice sets a. 

103. See, for example, any front cover of SICOMP^ or of SIAM Review since 1970. 


104. Assume that m < n. The case m = n is clearly impossible, because all four 
corners must be occupied. When rrt is odd and n = m + A: + 1 ， put m bishops in the 
first and last columns, then k in the middle columns of the middle row. When m is even 
and n = m + 2 A: + 1 ， put m in the first and last columns, and two in the middle rows 
of columns m/2 + 2j for 1 < j < k. There’s no solution when m and n are both even ， 
because the maximum number of independent bishops of each color is (m + n — 2)/2. 

[R. Berghammer, LNCS 6663 (2011), 103-106.] 

105. (a) We must have = (1 ， 0) for t pairs ij^ and (0,1) for t other pairs; 

otherwise = x\j . Hence there are 2 mn _ 2t solutions. 


(b) Use 2mn variables , y[j for 1 < i < m and 1 < j < n, with binary clauses 
(yij V y[j) 5 together with m + n + 2(m + n — 1) sets of cardinality constraints such as 
( 20 ) and ( 2 i) to enforce the balance condition ^2{yij +Vij | ij G L} = \L\ for each row, 
column, and diagonal line L. 

(c) T(m ， n) = 1 when min(m ， n) < 4 ， 
because only the zero matrix qualifies in 
such cases. Other values can be enumerated 
by backtracking, if they are small enough. 

(The asymptotic behavior is unknown.) 

(d) Suppose m < n. Any solution with 
nonzero top row，bottom row, left column, and right column has all entries zero except 


n = 

4 

5 

6 

7 

8 

T(4, n)= 

3 

7 

17 

35 

77 

T(5,n) = 

7 

31 

109 

365 

1367 

T( 6 ,n) = 

17 

109 

877 

6315 

47607 

T(7,n) = 

35 

365 

6315 

107637 

1703883 

r( 8 ,n) = 

77 1367 47607 1703883 66291089 
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that yit = — ytl = y(m+l — t)l = — Umt = ym(n+l — t) = — — = Utn = — ?/l(n+l — 0 , 

for some t with 1 < t < m/2. So the answer is 2 \_(k — 1)/2 」 （m — k){n — k), which 

simplifies to —l)(4g(n —g) — 5n+2g+3+(mmod 2)(6n — 8 g —5))/3 when q = [m/2\. 

[The answer in the case (m ， n) = (25, 30) is 36080; hence a random 25 x 30 image 
will have an average of 36080/256 ^ 140.9 tomographically equivalent “neighbors” that 
differ from it in exactly eight pixel positions- Figure 36 has five such neighbors, one of 
which is shown in answer 111 below-] 

(e) We can make all entries nonzero except on the main diagonals (see below) - 
This is optimum，because the diagonal lines for ai, a 3 , •… ， a 4 n-i, in, 63 , ••” & 4 n-i 
must each contain a different 0. So the answer is 2n(n — 1)- (But the maximum for 
odd sized boards is unknown; for n = (5, 7, 9) it turns out to be (6, 18, 33).) 


Gerdes 
unit clauses 


0+++ — 0 
- 0++ —— 0 + 
—— 0 +- 0 ++ 

- 00 +++ 

+++00 — 
++ 0 -+ 0 —— 

+ 0 ——++ 0 - 

0+++0 - 0 

-+++ 0 + 

0++——00 

0 ++ ——00 

0 —+—+——++ 

——++000 

——++000 

++-++ 0 — 

0 + ——00 + 

0 +- 0 - 0 + 

-+- +—— 0 ++ 

一 00 + 0 —+ 

一 0 —+ 00 + 

+ 0 - 0 + 0 + —— 

+ 0 - 0 + 0 - 

+ 000 +—— 

——+——+++0 

+ 0 + 0 - 0 - 

+ 0 +- 00 - 

++++— 0 — 
0-0 一一 +++0 

0 —— 0++0 

0 —— 0++0 

0 - h ++0 


(f) The smallest counterexamples are 7x7 (see above). 

106. In an m x n problem we must have 0 < ri < 0 < Cj < and 0 < ad^bd < 

min{d ， m, n, m + n — d}. So the total number B of possibilities，assuming that m < n, is 
(n + l) m (m + l) n ((m +1)! (m + l) n_m m!) 2 , which is « 3• 10 197 when (m, n) = (25,30). 
Since 2 7 ^°/B » 2 - 10 28 ， we conclude that a “random” 25 x 30 digital tomography 
problem usually has more than 10 28 solutions. (Of course there are other constraints 
too; for example, the fact that ^ Vi = Y1 c j = J2 a d reduces B by at least a 

factor of (n + l)(m + l) 2 .) 

107. (a) (n,...,r 6 ) = (11,11,11,9,9,10); (ci,... ,ci 3 ) = ( 6 ,5, 6 ,2,4,4, 6 ,5,4,2, 6 ,5, 6 ); 
(ai,...,a 6 ) = (11,10,9,9,11,11); ( 6 i,... , 612 ) = ( 6 ,1, 6 ,5,7,5, 6 ,2, 6 ,5,7,5). 

(b) There are two others，namely the following one and its left-right reversal: 

MM MM MM MM … • 


[Reference: P. Gerdes, Sipatsi (Maputo: U. Pedagogica, 2009)，page 62, pattern #122.] 

108. Here are four of the many possibilities: 



109. FI. [Initialize.] Find one solution yi .. .y n ^ or terminate if the problem is unsat - 

isfiable. Then set y n +i 1 and d *<— 0. 

F2. [Advance d.] Set d to the smallest j > d such that yj = 1 - 

F3. [Done?] If d > n, terminate with yi ••• y n as the answer. 

F4. [Try for smaller.] Try to find a solution with additional unit clauses to force 
Xj = yj for 1 < j < d and Xd = 0. If successful, set yi … y n xi... x n . 
Return to F2* | 

Even better is to incorporate a similar procedure into the solver itself; see exercise 275- 
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110. Algorithm B actually gives these directly: 

001111111011101111100101111101111011111110111011011111111100101111101111011111100111011111110111 

111111111011111111001100111111001111011111111010111111110111101111111001100111110110111101111111 

111. This family of problems appears to provide an excellent (though sometimes formi¬ 
dable) series of benchmark tests for SAT solvers- The suggested example has solutions 



(a) colexicographically first; (b) minimally different; (c) colexicographically last; 

and several of the entries in (a) were by no means easy- An even more difficult case 
arises if we base lexicographic order on a rook path that spirals out from the center 
(thus favoring solutions that are mostly 0 or mostly 1 in the middle): 



(a) spiral rook path; (b) “spirographically” first; (c) “spirographically” last. 


benchmark tests 
rook path 
spiral 

cutting plane 
Balas 
Fischetti 
Zanette 
Brunetti 
Del Lungo 
Gritzmann 
de Vries 


Here many of the entries have never yet been solved by a SAT solver, as of 2013, although 
again IP solvers have no great difficulty. In fact, the “lexicographic pure cutting plane” 
procedure of E. Balas, M. Fischetti，and A. Zanette [Math. Programming A130 (2011 )， 
153-176; A135 (2012), 509-514] turns out to be particularly effective on such problems- 

112. Reasonably tight upper and lower bounds would also be interesting. 

113. Given sm N x N x N contingency problem with binary constraints Cjk = X*jk, 

Rik = Xu, Pij = Xij” we can construct an equivalent n x n digital tomography 
problem with n = N 2 + N 3 + N 4 as follows: First construct a four-dimensional tensor 
Yijkl = 叉 (/©LyK，where / ㊉ L = l + (/ + L — 1) mod N, and notice that Y^jkl = 
Yijk^ = Yi^kl = X(j©l)*k ， Yij^l = 叉 Then define Xij for 1 < 

ij < n by the rule Xij = Yijkl when i = I - N 2 K + N S L, j = NJ + N 2 K + N 3 L, 
otherwise Xij = 0. This rule makes sense; for if 1 < 1,1’ ， J, J r ,K,K r ,L,L! S N and 
I-N 2 K + N 3 L = I f -N 2 K f ^N 3 L f and NJ + N 2 K + N 3 L = Nf +N 2 K f 

we have I 三 I’ (modulo iV); hence 1 = 1’ and K 三 K f •, hence K = K 、 L = L!, J = J f . 

Under this correspondence the marginal sums are ri = Yi^kl when i = I—N 2 K+ 
N S L, Cj = Y.jkl when j = NJ~\~N‘ 2 K~\~N 3 L,a d = Yij.l when d+l = I-^NJ-^2N S L, 
bd = Yijk^ when d — n = I — NJ — 2N 2 K, otherwise zero. [See S- Brunetti ， A. Del 
Lungo, P- Gritzmann, and S. de Vries, Theoretical Comp. Sci. 406 (2008), 63-71.] 
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114. (a) From ^ 7,23 + $7,24 = ^ 7,23 + 尤 7,24 + X 7/25 = 2 : 7,24 + X 7/25 = 1 we deduce 
3 : 7,23 = X 7/25 = 0 and X7，‘24 = 1， revealing 717,23 = ri 7，‘25 = 5. Now X 6/23 + 工 6,24 = 
$6,24 + $6,25 = 尤 4,24 + $5,24 + 尤 6,24 + x 6 /25 = 1 ； hence ^4,24 = $ 5,24 = 0 ， revealing 
714,24 = ri 5,24 = 2. So X6/23 = 尤 6,25 = 0, and the rest is easy. 

(b) Let yij mean “cell (i,j) has been probed safely, revealing n。.” Consider 
the clauses C obtained by appending yij to each clause of the symmetric function 

X i / J , = 叫，儿 for all i，j with Xij = 0- Also include (xijWyij), as well 
as clauses for the symmetric function Sn(x) if we’re told the total number N of mines. 

Given any subset F of mine-free cells，the clauses Cf = C A /\{yij | (i, j) G F} 
are satisfiable precisely by the configurations of mines that are consistent with the data 
{riij I (i^j) G F}. Therefore cell (i ， j) is safe if and only if Cf A Xij is unsatisfiable- 
A simple modification of Algorithm C can be used to “grow” F until no further 
safe cells can be added: Given a solution to Cf for which neither Xij nor Xij was 
obtained at root level (level 0 )，we can try to find a “flipped” solution by using the 
complemented value as the decision at level 1- Such a solution will be found if and only 
if the flipped value is consistent; otherwise the unflipped value will have been forced at 
level 0. By changing default polarities we can favor solutions that flip many variables 
at once. Whenever a literal Xij is newly deduced at root level，we can force yij to be 
true，thus adding (i,j) to F. We reach an impasse when a set of solutions has been 
obtained for Cf that covers both settings of every unforced Xij. 

For problem (i) we start with F = {(1 5 1)} ， etc. Case (iv) by itself uncovers only 
56 cells in the lower right corner. The other results, each obtained in < 6 G// ， are: 


W ， ⑹ 

88 勝礙 ? 88888888888 ? 礙 §?8888 

QQ13XRXX3101233333212X44X2OQQQ 
QQ2X434XX433XXXXXXX35X32X3lQQQ 
003X5X34XXXX3333333XXX424X2000 

隱髒鑼画願 I 漏 

QQ2X310QQQQQQQQQQQQQ12X23X31QQ 
QQ12X210QQQ0QQQQQ00QQ112X4X2QQ 
00013X21232100000123210114X300 

QQ003 x 42xxx2iQQQi2xxx2 1003 x 300 

22113X4X6S5X2QQQ2X6I5X2QQ2X31Q 
XX34X34XXI5X30002XX■5X20013X20 
23XXX43XXXXX312123XXX2lQQO3X3Q 
245 XX2233323X4X2 12321QQQ1 4X52 
XXXB653100013X5X3000012333XXXX 
■■■XXXX1QQQ1X45X2QQQQ1XXXX5 □□匚 
mmi 42222112XX21QQQ135■■■■■ 

211XX43443100001XX 
2X334XXXXX433333444 
■■■Xn4HHXXXXXXXXX2 


(iii) 

888^J?888888888889^198888 

P 额雜题綴缀遝趨激 ■ 

003X5X34XXXX3333333XXX424X2000 

麟_ 麵腿議 麗麵 

XX34X34XX !§><30082><^15><2001^20 

8賴叕飪?艱叕祜挪游叕腰闢圜 

8888 _f: 篇 ■■■■■ 


(V) 

888^9888888888889^198888 

QQ13XRXX3 1012333332 12X44X20QQQ 
QQ2X434XX433XXXXXXX35X32X31QQQ 
003X5X34XXXX3333333XXX42 4X2 000 

麟 I 離綱_麟灑 

QQ2X310QQQQQQQQQQQQQ12X23X31QQ 
QQ12X210QQQ0QQQQQ0QQ0112X4X2QQ 
000 13X212321000001232 101 14X300 
QQ003 x 42XXX21QQQ12XXX21QQ3X30Q 
22113X4X6■5X2QQQ2X6I5X2QQ2X31Q 
XX34X34XXIbX30002XXI5X20013X20 
23XXX43XXXXX312123XXX21QQ03X30 
245 6XX2233323X4X21232100014X52 
XXXX65 310001 3X5X3000012333XXXX 
235XXXX1QQQ1X45X2QQQ01XXXX5X 
0Q2X542222112XX21QQQ1356656X 
001 2X211XX43443 100001 XXXXXXXXX 
QQQ12X334XXXXX433333444345X432 
QQQQ12XX445 E XXXXXXXXX223X21QQ 
00000123XX3 ■ 鱺騙 _5554XX21O00 

QQQQQQQ123 5XXX321QQQQ 

000000000!■■■■■■■ XXX421000000 


Notice that the Cheshire cat’s famous smile defies logic and requires much guesswork! 

[For aspects of Minesweeper that are NP-complete and coNP-complete，see Kaye ， 
Scott ， Stege，and van Rooij ， Math. Intelligencer 22, 2 (2000) ， 9-15; 33, 4 (2011), 5-17.] 

115. Several thousand runs of the algorithm in the previous exercise, given that the 
total number of mines is 10, indicate success probabilities .490 士 .007, .414 ± .004, 
.279 ± .003, when the first guess is respectively in a corner, in the center of an edge, or 
in the center. 


116. The smallest is the “dock” in answer 69(b)- Other noteworthy possibilities are 
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■■■ 
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■■■ 

4 



■■国 


as well as the “phoenix” in Fig. A—7. 
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117* (a) Set xo = x n +i = 0, and let (a, 6 , c) be respectively the number of occurrences 
of (01, 10, 11) as a substring of xoxi ... x n -\-i. Then a + c = b + c = vx and c = x\ 
hence a = b = vx — x is the number of runs- 

(b) In this case the complete binary tree will have only n — 1 leaves，corresponding 
to {xiX 2 ^. - - , x n ^ix n }] therefore we want to replace n by n — 1 in ( 20 ) and ( 21 ). 

The clauses of ( 20 ) remain unchanged unless U < 3. When U = 2 they become 
(^ 2 /c-n+l V ^ 2 /c-n +2 V 61 ) A(^ 2 /c - n +2 V^2/c-n+3 V 61 ) A^X 2 /c-n+l V ^ 2 /c-n +2 WX2k-_ n +S^bl). 
When “ = 3 we have 2k = n — 1, and they become (bf k V 61 ) A (xiV X 2 V bi) A (b 2 k V b^) A 
(b\ k W xi Vx 2 V 6 §) A (bl k V xi V x 2 V b!). 

The clauses of ( 21 ) remain unchanged except in simple cases when n < 3. 

(c) Now the leaves represent xJxi+T = X{ M Xi^i. So we change ( 20 )，when tk = 2, 
to (x 2 /c-n+l V bi) A (x 2 k-n +2 V 61 ) A (x 2 k-n+3 V bi) A (x_ 2 k -n +2 V 62 ) A (x 2 k-n+l V 
X 2 k-n -\-3 V 62 ) - And there are eight clauses when tk = 3: (bl k V ) A (xi V bi) A (x 2 V 
bi) A (£f V A ( 6 'f V xi V b k 2 ) A ( 6 f V V b k 2 ) A W xi W b^) A (b 2 2 k W x 2 W b^). 

118. Let pij = [the pixel in row i and column j should be covered], and introduce 
variables hij when pij = pij+i = 1, Vij when pij = pi+ij = 1- The clauses are 

(i) (hij V hij-i V Vij V Vi 一 i ， j), whenever pij = 1, omitting variables that don’t exist; 

(ii) {hijWhij-i)^ (hij Vt；i j), (hij\/vi-ij)^ (hij-iyvij)^ (、，j —i V 氏一 i ， j), (vij \/vi~ij)^ 
whenever pij = 1, omitting clauses whose variables don’t both exist; and (iii) (hij V 
hi + ij\/vij\/vij + i)^ whenever pij+pij+i+pi+ij+pi+ij+i > 3, omitting variables that 
don’t exist. (The example has 10527 clauses in 2874 variables，but it’s quickly solved-) 

119. There’s symmetry between l and 『， also between l and 10 — l] so we need consider 
only l = (1 ， 2, 3,4, 5)，with respectively (4, 4, 6 , 6 , 8 ) occurrences. The smallest result is 
F_\ 5 = {123,_234,_678,_789,_246, 468, 147, 369, 123, 234, 34, 46, 67, 678, 789, 13, 246, 
37, 468, 79, 147, 28, 369, 19}. 

120. True. 

121 • The main point of interest is that an empty clause is typically discovered in the 
midst of step A3; partial backtracking must be done when taking back the changes 
that were made before this interruption. 

A3. [Remove L] Set p ^ F(l) (which is FU ㊉ 1)，see ( 57 )). While p > 2n + 2, 
set j <— C (p), i 4 — SIZE (j) ， and if i > 1 set SIZE (j) ^ i — 1^ p ^ F (p). But 
if S = 1， interrupt that loop and set p B(p); then while p > 2n + 2, set 
j C(p), i SIZE(j) ， SIZE(j) i + 1 5 p 4 — B(p); and finally go to A5. 

A4. [Deactivate Vs clauses.] Set p l F ⑴ • While p > 2n + 2， set j l C(p), 
i <r- START (j) ^ p F(p)，and for i < 5 < i + SIZE(j) — 1 set q F(s )， 
r B(s) ， B(g) l r ， F(r) g，and C(L(s)) C(L(s)) — 1. Then set 

a I a — C (D ， d l d + 1， and return to A2. 

AT. [Reactivate Vs clauses.] Set a 卜 a + C(D and p B(/)• While p > 2n + 2, 
set j <— C (p) ， i <— START (j), p *<— B(p) ， and for i < s < i + SIZE (j) — 1 set 
q F(s)，r —■ B(s) ， B(g) —• F(r) I s，and C(L(s)) 1 C(L(s)) + 1. (The 
links dance a little here.) 

A 8 . [Unremove L] Set p ^ F(l). While p > 2n + 2, set j <— C(p), i SIZE(j ) ， 
SIZE(j) i + 1， p F (p) • Then go to A5 - | 

122. Pure literals are problematic when we want all solutions, so we don’t take advan¬ 
tage of them here- Indeed, things get simpler; only the move codes 1 and 2 are needed. 

Al*. [Initialize.] Set d •<— 1. 
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A2' [Visit or choose.] If d > n, visit the solution defined by mi ... m n and go to 
A6*- Otherwise set / ^ 2d + 1 and rrid ^ 1. 

A3*. [Remove l.] Delete l from all active clauses; but go to A5* if that would 
make a clause empty. 

A4' [Deactivate Vs clauses.] Suppress all clauses that contain 1. Then set d ^ 
d + 1 and return to A2' 

A5*. [Try again.] If = 1， set i 2, Z —• 2d，and go to A3' 

A6*. [Backtrack.] Terminate if d = 1. Otherwise set d i- d — 1 and l ^ 2d + 
{md & 1)- 

A7*. [Reactivate Vs clauses-] Unsuppress all clauses that contain 1. 

A8' [Unremove [.] Reinstate l in all the active clauses that contain it- Then go 
back to A5*. | 

It’s no longer necessary to update the values C(k) for A: < 2n + 2 in steps A4* and A7' 

123. For example, we might have 

012345678 9 10 11 12 13 14 15 16 17 18 19 20 
L(p) = 397875653438286964742 


and START (j) = 21 - 3j for 0 < j < 7] W 2 = 3, W 3 = 7, W 4 = 4, = 0, = 5, 

W 7 = 1, Wg = 6 , Wq = 2. Also LINK (j) = 0 for 1 < j < 7 in this case. 

124. Set j 卜 Wf. While j — 0, a literal other than l should be watched in clause so 

we do the following: Set i START (j ) ， i’ l START(j — 1), j 4 — LINK(j), k ^ i + 1. 
While k < set l f L(k); if l f isn’t false (that is, if |Z’| > d or I’ + m\i>\ is even, 
see ( 57 ))，set L(i) L(A0 l 『， LINK(j) ^ j, j ^ and exit the loop 

on k] otherwise set A: 4- A: + 1 and continue that loop. If k reaches however, we 
cannot stop watching T; so we set W\ <— exit the loop on j，and go on to step B5- 

125. Change steps B2 and B4 to be like A2* and A4* in answer 122. 

126. Starting with active ring (6978)，the unit clause 9 will be found (because 9 
appears before 8 ); the clause 936 will become 639; the active ring will become (786)- 

127. Before: 11414545; after: 1142. (And then 11425 ， etc.) 


128. Active ring 

(1234) 

(2 3 4) 

(3 4) 

⑷ 

(3 4) 

(3 4) 

(3) 

(4 3) 

(2 4 3) 

(4 3) 

⑷ 

(3 4) 

(3 4) 

(3) 


0 --- 
o 0 - - 
000- 
0 --- 
o 1 - - 
01-0 



10 —— 
10-1 


Units Choice Changed clauses 

1 413 

2 124 

3 3 

4,4 Backtrack 

2 321,421 

4 4 314, 342 

3. 3 Backtrack 

1 214,413 

2 

3 3 432,231 

4.4 Backtrack 

2 321,412 

4 4 314,124,342 

3, 3 Backtrack 
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129. Set j Wi^ then do the following steps while j ^ 0: (i) Set p ^ START (j) + 1; 
(ii) if p = START (j — 1)，return 1; (iii) if L(p) is false (that is，if x^( p )\ = L(p) & 1 )， 
set p 卜 p + 1 and repeat (ii); (iv) set j 卜 LINK(j). If j becomes zero, return 0. 

130. Set l 2k + j ^ W\ 1 0, and do the following steps while j ^ 0: (i) Set 

f l LINK(j)，i i START(j )， p ^ i + (ii) while L(p) is false, set p ^ p + 1 (see 
answer 129; this loop will end before p = START (j — 1)); (iii) set Z’ l L(p) ， L(p) 4 - 
L(i) l f ] (iv) set p W[i and q Wp, and go to (vi) if p ^ 0 or q ^ 0 or x\if\ > 0; 
(v) if 尤 = 0， set t ^ h i— \l f \ and NEXT ⑴ ' /i，otherwise set NEXT (|^|) ^ h \l r \^ 
NEXT ⑴ h (thus inserting |Z’| = 厂 》 1 into the ring as its new head); (vi) set 
LINK(j) 卜 p ， W// j (thus inserting j into the watch list of l’)' (vii) set j j . 

[The tricky part here is to remember that t can be zero in step (v).] 

131. For example, the author tried selecting a variable Xk for which s‘ 2 k • s‘ 2 /c+i is 
maximum, where si is the length of Vs watch list plus e，and the parameter e was 0.1. 
This reduced the runtime for waerden(3^ 10; 97) to 139.8 gigamems，with 8.6 mega- 
no des* Less dramatic effects occurred with langford{!?>): 56-2 gigamems，with 10.8 
meganodes, versus 99.0 gigamems if the minimum S 2 k • & 2 /c+i was chosen instead. 

132. The unsatisfiable clauses (无 iVa> 2 )，(xiV 无 2 ) ， {x^xa)^ (T 3 V 元 4 ) ， ••• ， (^ 2 n-i Vx 2 n), 
(x 2 n-i V 无 2 n) ， (^ 2 n-i V 无 2 n) ， (^ 2 n-i V X 2 n) cause it to investigate all 2 n settings of xi ， 
工 3 ， … ， X2n-i before encountering a contradiction and repeatedly backtracking. 

(Incidentally, the successive move codes make a pretty pattern. If the stated 
clauses are ordered randomly, the algorithm runs significantly faster，but it still appar¬ 
ently needs nonpolynomial time. What is the growth rate?) 

133. (a) Optimum backtrack trees for n-variable SAT problems can be calculated with 
0(n3 n ) time and ©(3 n ) space by considering all 3 n partial assignments，“bottom up.” 
In this 9-variable problem we obtain a tree with 67 nodes (the minimum) if we branch 
first on X 3 and X 5 , then on xq if X 3 ♦ unit clauses arise at all other nodes. 

(b) Similarly, the worst tree turns out to have 471 nodes- But if we require the 
algorithm to branch on a unit clause whenever possible, the worst size is 187. (Branch 
first on xi，then X 4 ，then X 7 ] avoid opportunities for unit clauses-) 

134. Let each BIMP list be accessed by ADDR ， BSIZE ， CAP, and K fields, where ADDR is the 
starting address in MEM of a block that’s able to store CAP items, and CAP = 2 K ; ADDR is 
a multiple of CAP, and BSIZE is the number of items currently in use. Initially CAP = 4, 
K = 2, BSIZE = 0, and ADDR is a convenient multiple of 4. The 2n BIMP tables therefore 
occupy 8n slots initially. If MEM has room for 2 M items，those tables can be allocated 
so that the doubly linked lists AVAIL [A:] initially contain ak = (0 or 1) available blocks 
of size 2 k for each where 2 M — 8n = (ajw-i … aiao) 2 - 

Resizing is necessary when BSIZE = CAP and we need to increase BSIZE. Set 
a ^ ADDR, A: i K ， CAP i- 2&+ 1 , and let 6 l a ㊉ 2^ be the address of a’s buddy. If b is 
a free block of size 2 k 、 we’re in luck: We remove b from AVAIL [A:]; then if a & 2 k = 0, 
nothing needs to be done, otherwise we copy BSIZE items from a to 6 and set ADDR b. 

In the unlucky case when b is either reserved or free of size < 2 k \ we set p to the 
address of the first block in AVAIL [A/] ， where AVAIL [t] is empty for k < t < k f (or 
we panic if MEM’s capacity is exceeded) - After removing p from AVAILED’] ， we split off 
new free blocks of sizes 2 &+ 1 ， …，’一 1 if A:’ > A: + 1. Finally we copy BSIZE items 
from block a to block p, set ADDR l p, and put a into AVAIL [A:]. (We needn’t try to 
“collapse” a with its buddy, since the buddy isn’t free.) 

135. They’re the complements of the literals in BIMP(0. 
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136. Before, {(1 ， 2) ， (4, 2), (4, 5) ， (5,1) ， (5, 7), ( 6 , 9)}; after, {(1 ， 2) ， (4,2) ， ( 6 , 9)}. 

137. If p in a TIMP list points to the pair (u^v)^ let’s write u = U(p) and v = Yip). 

(a) Set N ^ n - G, x ^ VAR [TV], j ^ INX [X] , VAR[j] < - INXM ^ j, 
VAR[iV] •<— X, INX[X] •<— N. Then do the following for l = 2X and l = 2X -1- 1, 
and for all p in TIMP(/): u ^ U(p), i; < - Yip), p ^ LINK(p), p n ^ LINK(p )； 
s •<— TSIZE(tZ) —1 ， TSIZE ( 丑 ) —s, i — pair s of TIMPC^x); if p’ 士 t, swap pairs by setting 
u ^ U(0 5 v ^ V(0, q < - LINK ⑴， — LINK(g), LINK(^) ^ p\ LINK(p) 4 - t, 
U(p) ^ u\ V(p) v , LINK(p) q, U(0 ^ v, V ⑴“， LINK ⑴ l 〆’• Then set 
s •<— TSIZE ⑹ —1 ， TSIZE ⑹ —s, i *<— pair s of TIMP ⑹; if p n ^ swap pairs by setting 
u ^ U(0, v ^ V(0, q LINK(t), q LINK(g) ， LINK(g’）l p" ， LINK(p) ^ t, 

^ u\ Y(p f ) <- v\ LINKCp 7 ) ^ q, U ⑴ V(0 u, LINK(0 p. 

Notice that we do not make the current pairs of TIMP (D inactive* They won’t be 
accessed by the algorithm until it needs to undo the swaps just made. 

(b) In VAR and in each TIMP list, the active entries appear first- The inactive entries 
follow, in the same order as they were swapped out, because inactive entries never 
participate in swaps. Therefore we can reactivate the most - recently - swapped - out entry 
by simply increasing the count of active entries. We must, however 5 be careful to do this 
“virtual unswapping” in precisely the reverse order from which we did the swapping. 

Thus, for l = 2X + 1 and l = 2X, and for all p in TIMP(D，proceeding in the 
reverse order from (a), we set u ^ U(p), v — V(p) ， TSIZE(i)) *<— TSIZE(5) + 1， and 
TSIZE(?x) TSIZE(^) +1- 

(The number N of free variables increases implicitly, because N + E = n in 
step L12. Thus nothing needs to be done to VAR or INX.) 

138. Because v G BIMP ( 丑） ,( 62 ) will be used to make u nearly true. That loop will 
also make v nearly true，because v G BIKP(u) is equivalent to u ^ BIMP ( 云 ） • 

139. Introduce a new variable BSTAMP analogous to ISTAMP ， and a new field BST (0 
analogous to 1ST (D in the data for each literal 1. At the beginning of step L9, set 
BSTAMP 4 - BSTAMP + 1, then set BST(D — BSTAMP for / = ^ and all l G BIMP(&). Now ， 
if BST ⑹ # BSTAMP and BST(v) ^ BSTAMP, do the following for all w G BIMP(v): If w 
is fixed in context NT (it must be fixed true, since w implies v)^ do nothing. Otherwise 
if BST(w) = BSTAMP ， perform ( 62 ) with l *f- u and exit the loop on w (because u 
implies both w and w). Otherwise, if BST(w) ^ BSTAMP, append w to BIMP ( 丑） and u 
to BIKP(w). (Of course ( 63 ) must be invoked when needed.) 

Then increase BSTAMP again, and do the same thing with u and v reversed. 

140. Unfortunately, no: We might have Q(n) changes to BSIZE on each of 0(n) levels 
of the search tree- However, the ISTACK will never have more entries than the total 
number of cells in all BIMP tables (namely 2 M in answer 134) • 

141. Suppose I STAMP 卜 (ISTAMP + l)mod2 e in step L5 - If ISTAMP = 0 after that 

operation, we can safely set ISTAMP ^ 1 and 1ST (/) 0 for 2 < / < 2n +1. (A similar 

remark applies to BSTAMP and BST(D in answer 139.) 

142. (The following operations，performed after BRANCH M is set in step L2, will also 
output T to mark levels of the search where no decision was made.) Set BACKL Id] 

r 1 A: 0, and do the following while k < d: While r < BACKF [A:] ， output ‘6 + (i? r &1)’ 

and set r ^ r+1- If BRANCH [A:] < 0, outputotherwise output c 2BRANCH[A;] 
and set r r + 1. While r < BACKL [A: + 1], output ‘4 + (i? r & 1)’ and set r 卜 r + 1. 
Then set A: ^ A: + 1. 
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143. The following solution treats KINX and KSIZE as the unmodified algorithm treats 
TIMP and TSIZE. It deals in a somewhat more subtle way with CINX and CSIZE: If 
clause c originally had size and if j of its literals have become false while none have 
yet become true, CSIZE(c) will he k _ j, but the nonfalse literals will not necessarily 
appear at the beginning of list CINX (c) . As soon as j reaches k — 2^ or one of the literals 
becomes true, clause c becomes inactive and it disappears from the KINX tables of all 
free literals. The algorithm won’t look at CINX(c) or CSIZE(c) again until it unfixes the 
literal that deactivated c. Thus a big clause is inactive if and only if it has been satisfied 
(contains a true literal) or has become binary (has at most two nonfalse literals). 

We need to modify only the three steps that involve TIMP. The modified step Ll ， 
call it LI’，inputs the big clauses in a straightforward way. 

Step L7 / removes the formerly free variable X from the data structures by first 
deactivating all of the active big clauses that contain L: For each of the KSIZE (L) 
numbers c in KINX(L), and for each of the CSIZE(c) free literals u in CINX(c)，we swap 
c out of u^s clause list as follows: Set s 4 - KSIZE (w) — 1, KSIZE (?x) 5 ； find t < s with 

KINX(ix) [t] = c\\it ^ s set KINX ⑹⑷卜 KINX ⑹ [s] and KINX (u) [s] ^ c. [Heuristic: 
If the number of free literals remaining in c is small compared to c’s original size, 
for example if say 15 or 20 original literals have become false, the remaining nonfalse 
literals can usefully be swapped into the first CSIZE(c) positions of CINX(c) when c is 
being deactivated. The author’s experimental implementation does this when CSIZE(c) 
is at most 6 times the original size, where the parameter 6 is normally 25/64.] 

Then step L7 f updates clauses for which L has become false: For each of the 
KSIZE (L) numbers c in KINX (Z)，set s — CSIZE(c) - 1 and CSIZE(c) — s; if s = 2, 
find the two free literals {u^ v) in CINX(c) ， swap them into the first positions of that list ， 
put them on a temporary stack，and swap c out of the clause lists of u and v as above. 

Finally，step L7 f does step L8 7 = L8 for all v) on the temporary stack. [The 
maximum size of that stack will be the maximum of KSIZE(D over all Z，after step LI’; 
so we allocate memory for that stack as part of step LI’.] 

In step L12 7 we set L l Re, X |L|, and reactivate the clauses that involve X 
as follows: For each of the KSIZE (L) numbers c in KINX(L) ， proceeding in reverse order 
from the order used in L7\ set s ^ CSIZE(c) ， CSIZE(c) l s + 1; if s = 2, swap c back 
into the clause lists of v and where u = CINX(c) [0] and v = CINX(c) [1]. For each 
of the KSIZE(L) numbers c in KINX(L) , and for each of the CSIZE(c) free literals u 
in CINX(c), again proceeding in reverse order from the order used in L7’, swap c back 
into the clause list of u. The latter operation simply increases KSIZE (?x) by 1. 

144. False; h’ （ l) =0-1 if and only if the complement, l, doesn’t appear in any clause. 

145. By symmetry we know that h{l) = h(l) = h(10 — /) for 1 < / < 9 at depth 0, 
and the BIMP tables are empty. The first five rounds of refinement respectively give 
(h(l),...,h(5)) = (4.10,4.10,6.10,6.10,8.10), (5.01,4.59,6.84, 6.84, 7.98), (4.80,4.58, 

6.57, 6.57,8.32), (4.88,4.54, 6.72, 6.67, 8.06), and (4.85,4.56, 6.63,6.62, 8.23), slowly con¬ 
verging to the limiting values 

(4.85810213,4.55160111, 6.66761920, 6.63699698, 8.16778057). 

When d = 1, however, the successively refined values of (/i(l), /i(l), • • •, /i(4) ， h(4)) are 
erratic and divergent: (2.10, 8.70, 3.10, 6.40, 3.10, 13.00, 3.10, 10.70) ， (5.53, 3.33, 9.05, 

2.58, 5.40, 5.57, 8.24, 4.83), (1.43, 9.60, 2.32, 10.06, 1.30, 16.96, 1.97, 15.54), (8.04, 1.42, 
12.31 ， 1.29, 7.45, 2.39, 11.91 ， 1.81), (0.32, 14.19, 0.42, 15.63, 0.30, 25.67, 0.43, 24.17). 
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They eventually oscillate between limits that favor either positive or negative literals: 

(0.1012,17.4178, 0.1019,19.7351, 0.1015, 31.6345, 0.1021, 30.4902) and 

(10.3331, 0.1538,15.8485, 0.1272,9.6098, 0.1809,15.4207, 0.1542). 

[Equations ( 64 ) and ( 65 ), which were inspired by survey propagation, first appeared 
in unpublished work of S. Mijnders, B. de Wilde, and M. J. H. Heule in 2010. The 
calculations above indicate that we needn’t take h(l) too seriously, although it does seem 
to yield good results in practice. The author’s implementation also sets h f (l) <— © if 
the right-hand side of ( 65 ) exceeds a threshold parameter ©, which is 20.0 by default.] 

146. Good results have been obtained with the simple formula h(l) = s + KSIZE(0 + 
5 ^ugbimp(/) ^free K SIZE(&)，which estimates the potential number of big-clause reduc¬ 
tions that occur when l becomes true. The parameter e is typically set to 0.001. 

147. cx), 600, 60, 30, 30. 

148. If a problem is easy, we don’t care if we solve it in 2 seconds or in .000002 seconds- 
On the other hand if a problem is so difficult that it can be solved only by looking ahead 
more than we can accomplish in a reasonable time, we might as well face the fact that 
we won’t solve it anyway. There’s no point in looking ahead at 60 variables when 
d = 60， because we won’t be able to deal with more than 2 °° or so nodes in any 
reasonable search tree. 

149. The idea is to maintain a binary string SIG (x) for each variable representing the 
highest node of the search tree in which x has participated- Let bj = [BRANCH [j] = 1 ]， 
and set a t bo … bd-i at the beginning of step L2, a bo.. .bd at the beginning of 
step L4. Then x will be a participant in step X3 if and only if SIG (x) is a prefix of a. 

We update SIG(x) when x = \u\ or x = \v\ in step L9，by setting SIG(x) a 
unless SIG (x) is a prefix of a. The initial value of SIG(x) is chosen so that it is never 
a prefix of any possible cr. 

(Notice that SIG(x) needn’t change when backtracking. In practice we can safely 
maintain only the first 32 bits of a and of each string SIG(a:), together with their exact 
lengths, because lookahead computations need not be precise. In answer 143, updates 
occur not in step L9 but in step L7 ’； they are done for all literals u ♦ L that appear in 
any big clause containing L that is being shortened for the first time.) 

150. Asserting 7 at level 22 will also 22fix i，because of the clause 147. Then 1 will 
22fix 3 and 9, which will 22fix 2 and 6 , then 8 ; and clause 258 becomes false. Therefore 
7 becomes proto true; and ( 62 ) makes 3, 6 , 9 all proto true, contradicting 369• 

151. For example, one such arrangement is 

/: 2893167447613982 
o(l): 4 2 10 14 6 16 8 12 22 26 18 28 20 24 32 30 * 

[Digraphs that are obtainable in this way are called “partial orderings of dimension 
< 2^ or permutation posets. WeVe actually seen them in exercise 5.1.1-11, where the 
set of arcs was represented as a set of inversions. Permutation posets have many nice 
properties, which we shall study in Section 7.4.2. For example, if we reverse the order 
of the list and complement the offsets, we reverse the directions on the arrows. All but 
two of the 238 connected partially ordered sets on six elements are permutation posets. 
Unfortunately, however, permutation posets don’t work well with lookahead when they 
aren’t also forests. For example, after lOfixing ‘9’ and its consequences, we would want 
to remove those literals from the R stack when 14fixing ‘3 ’； see ( 71 )- But then we’d 
want them back when 6 fixing ‘！’•] 
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152. A single clause such as ‘ 12’ or ‘123’ would be an example, except that the autarky 
test in step X9 would solve the problem before we ever get to step X3. The clauses 
{123,123,123,123,245,345} do, however ， work: Level 0 branches on a; 1 ， and level 1 
discovers an autarky with b and c both true but returns 1 = 0. Then level 2 finds all 
clauses satisfied, although both of the free variables xa and X 5 are newbies. 

[Indeed, the absence of free participants means that the fixed-true literals form 
an autarky. If TSIZE(/) is nonzero for any free literal /， some clause is unsatisfied- 
Otherwise all clauses are satisfied unless some free l has an unfixed literal G BIMP(D •] 

153. Make the CAND array into a heap, with an element x of least rating r{x) at the top 
(see Section 5.2.3) - Then，while C > (7 m ax, delete the top of the heap (namely CAND [0] )• 

154. The child — > parent relations in the subforest will be d — >c — b ― c — >d^ 
and either a — >b or a — >c. Here’s one suitable sequence, using the latter: 

preorder babcddca 
2-postorder 2 10 4 8 6 16 14 12 

155. First construct the dependency graph on the 2C candidate literals，by extracting 
a subset of arcs from the BIMP tables. (This computation needn’t be exact, because 
we’re only calculating heuristics; an upper bound can be placed on the number of arcs 
considered, so that we don’t spend too much time here- However, it is important to 
have the arc u — >v if and only if v — >u is also present-) 

Then apply Tarjan’s algorithm [see Section 7.4.1，or SGB pages 512-519]- If a 
strong component contains both l and l for some /， terminate with a contradiction. 
Otherwise, if a strong component contains more than one literal, choose a representa¬ 
tive l with maximum h(l)\ the other literals of that component regard l as their parent. 
Be careful to ensure that l is a representative if and only if l is also a representative. 

The result will be a sequence of candidate literals hh .. • /s in topological order, 
with U ― > lj only if i > j. Compute the “height” of each lj^ namely the length of the 
longest path from lj to a sink. Then every literal of height h > 0 has a predecessor 
of height h — 1 ， and we let one such predecessor be its parent in the subforest- Every 
literal of height 0 (a sink) has a null parent. Traversal of this subforest in double order 
(exercise 2.3.1—18) now makes it easy to build the LL table in preorder while filling the 
L0 table in postorder. 

156. If l doesn’t appear in any clause of then A = {/} is clearly an autarky. 

157. Well, any satisfying assignment is an autarky* But more to the point is the 
autarky {1,2} for F = {123,124, 34}. 

158. BIMP(0 and TIMP(0 will be empty, so w will be zero when Algorithm X looks 
ahead on 1. Thus l will be forced true, at depth d = 0. (But pure literals that arise 
in subproblems for d > 0 won’t be detected unless they’re among the preselected 
candidates.) 

159. (a) False (consider A = {1}，F = {1 ， 2, 12}); but true if we assume that F \ A is 
computed as a multiset (so that F\ A would be {2, 2} ^ F in that example). 

(b) True: Suppose A = A f U A f, ^ A! D A f, = 0, and A" or A" touches C ^ F\A r . 

Then (7 fl ^4’ = 0 and C C' G F, where C' C X. Since A or A touches C U C\ some 
a ^ C U C f is in A; hence a ^ A f, . 

160. (a) If the gray clauses are satisfiable, let all black literals be true. [Notice ， 
incidentally，that the suggested example coloring works like a charm in ( 7 ).] 
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(b) Given any set A of strictly distinct literals, color l black if / G A, white if 
[G A, otherwise gray- Then A is an autarky if and only if condition (a) holds. 

[E. A. Hirsch ， Journal of Automated Reasoning 24 (2000) ， 397-420.] 

161. (a) If F r is satisfiable, so is F. If F is satisfiable with at least one blue literal false ， 
so is F r . If F is satisfiable with all the blue literals true, make all the black literals true 
(but keep gray literals unchanged). Then F f is satisfied, because every clause of F r that 
contains a black or blue literal is true，hence every clause that contains a white literal 
is true; the remaining clauses, whose literals are only orange and gray, each contain at 
least one true gray literal- [The black-and-blue condition is equivalent to saying that 
A is a conditional autarky ， namely an autarky of F \ L. Tseytin’s notion of “extended 
resolution” is a special case, because the literals of A and L need not appear in F. See 
S. Jeannicot, L. Oxusoff, and A. Rauzy ， Revue d’intelligence artificielle 2 (1988), 41—60, 
Section 6 ; O- Kullmann ， Theoretical Comp. Sci. 223 (1999) ， 1—72, Sections 3, 4, and 14.] 

(b) Without affecting satisfiability, we are allowed to add or delete any clause 
C = (a V 『1 V … V 匕 ） for which all clauses containing a also contain h or … or lq. 
(Such a clause is said to be “blocked” with respect to a, because C produces nothing 
but tautologies when it is resolved with clauses that contain a.) 

(c) Without affecting satisfiability，we are allowed to add or delete any or all of 
the clauses (fVai) ， •… ， ([Va p ), if A is an autarky of F| /; that is，we can do this if A is 
almost an autarky, in the sense that every clause that touches A but not A contains l. 

(d) Without affecting satisfiability, we are allowed to add or delete the clause 
(I V a) whenever every clause that contains a also contains 1. 

162. Construct a “blocking digraph” with 广 4 Z when every clause that contains 
literal l also contains l f . (If l is a pure literal, we’ll have Z’ l for all l f ] this case can 
be handled separately. Otherwise all in-degrees will be less than A: in a /cSAT problem, 
and the blocking digraph can be constructed in 0 (k 2 m) steps if there are m clauses.) 

(a) Then (/ V l f ) is a blocked binary clause if and only if / T or V ^ 1. (Hence 
we’re allowed in such cases to add both l — >l f and V ― >1 to the dependency digraph.) 

(b) Also A = {a ， a’} is an autarky if and only if a ^ a ^ a. (Moreover, any 
strong component {ai ， •… , at} with t > 1 is an autarky of size t.) 

163. Consider the recurrence relations T n = 1 + max(T n —i ， T n —‘ 2 , C/ n = 1 + 

max(T n _i,T n _ 2 , t/n-i+K-i), K = l + U n -i for n > 0, with T_i = T 0 = U 0 = V 0 = 0. 
We can prove that T n , U n ,V n are upper bounds on the step counts, where U n refers to 
cases where F is known to have a nonternary clause, and V n refers to cases when 5 = 1 
and R2 was entered from R3: The terms T n _i and T n 一 ‘2 represent autarky reductions 
in step R2; otherwise the recursive call in R3 costs t/ n — 1 ， not T n _i, because at least one 
clause contains l s - We also have V n = 1 + f/ n -i，not 1 + T n _i, because the preceding 
step R3 either had a clause containing I 2 not h or a clause containing h not Z‘ 2 . 

Fibonacci numbers provide the solution: T n = 2 F n +2 — 3 + [n = 0], U n = F n +3 — 2, 
V n = F n +2 —1- [Algorithm R is a simplification of a procedure devised by B. Monien and 
E. Speckenmeyer, Discrete Applied Mathematics 10 (1985), 287—295，who introduced 
the term “autarky” in that paper* A Stanford student, Juan Bulnes，had discovered 
a Fibonacci - bounded algorithm for 3SAT already in 1976; his method was, however, 
unattractive, because it also required Q( 0 n ) space.] 

164. If k < 3 ， T n = n is an upper bound; so we may assume that k > 3. Let 

Un = 1 + max(T 7 n _ 1 5 T n — 2 5 Un—l ^1—1 ， 1 ， • • • ， 1 + 1,/c — 2) ^ ^n，l = 1 + Un 一 1 ， 

and T4，s = 1 + max(f/ n —i ， T n 一 2 , - 1 + Vn-i， s 一 1 ) for s > 1， where V n ^ s refers to an 

entry at R2 from R3- The use of f/ n _i in the formula for V n ^ s is justified, because the 
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previous R3 either had a clause containing / 5 +i not l s or one containing l s not One 

can show by induction that V n ^ s = 5 + f/ n -i H - h U n 一 s ,U n = T4，/c - i; and T n =U n + 

[7 n —/c + 1 = 2f/ n _i + l if n > A:. For example, the running time when k = 4 is bounded by 
Tribonacci numbers, whose growth rate 1.83929 n comes from the root of x 3 = x 2 +x + l. 

165. Clause 134 in the example tells us that 1 ， 3,4 拿 A. Then 136 implies 6 ^ A. But 
A = {2, 5} works, so it is maximum. There always is a maximum (not just maximal) 
positive autarky, because the union of positive autarkies is a positive autarky. 

Each clause (m V • • • V % V v s -\-i V • • • V v s +t) of where the v^s are positive, tells 
us that vi ^ A and … and v s 味 A implies v s +j ^ for 1 < j <t. Thus it essentially 
generates t Horn clauses, whose core is the set of all positive literals not in any positive 
autarky* A simple variant of Algorithm 7.1-1C will find this core in linear time; namely, 
we can modify steps Cl and C5 in order to get t Horn clauses from a single clause of F. 

[By complementing a subset of variables, and prohibiting another subset，we can 
find the largest autarky A contained in any given set of strictly distinct literals- This ex¬ 
ercise is due to unpublished work of O- Kullmann, V. W. Marek, and M. Truszczyiiski.] 

166. Assume first that PARENT (/o) = A, so that H(lo) = 0 at the beginning of X9 
(see X 6 ). Since lo = LL [j] is not fixed in context T, we have Rf = lo by ( 62 ). 
And A = {Rf^ Rf+i, ••” 丑五 — 1 } is an autarky, because no clause touched by -A or A 
is entirely false or contains two unfixed literals. Thus we’re allowed to force lo true 
(which is what “do step X12 with l Z。” means) - 

On the other hand if w; = 0 and PARENT do) = so that H(lo) = H(p) > 0 in X 6 , 
the set A = {Rf^ ••” Re-i} is an autarky with respect to the clauses of F\p. Hence 
the additional clause (lo V p) doesn’t make the clauses any less satisfiable, by the black 
and blue principle. (Notice that (Jo V p) is already a known clause; so in this case lo is 
essentially being made equal to its parent.) 

[The author’s implementation therefore goes further and includes the step 

VAL[|/ 0 |] I VAL[|p|] ©((Z 0 ©p)&l), (*) 

which promotes the truth degree of lo to that of p. This step violates the invariant 
relation ( 71 )，but Algorithm X doesn’t rely on ( 71 ).] 

167. If a literal l is fixed in context T during the lookahead, it is implied by lo- In 
step Xll we have a case where l is also implied by To ； hence we’re allowed to force its 
truth, if l isn’t already proto true. In step X 6, 『0 is implied by /o, so lo must be false. 

168. The following method works well in march: Terminate happily if F = n. (At 

this point in Algorithm L, F is the number of fixed variables, all of which are really 
true or really false.) Otherwise find l G {LL [0] ，…， LL LS — 1] } with l mod 2 = 0 
and maximum (H(l) + .1)( 丑 (Z+l) + -l)- If l is fixed, set l 卜 0. (In that case, 
Algorithm X found at least one forced literal, although U is now zero; we want to do 
another lookahead before branching again.) Otherwise, if H{1) > 丑 (Z + l)，set / / + 1- 

(A subproblem that is less reduced will tend to be more satisfiable.) 

169. When a and b are positive, the function f(x) = e~ ax + e_ bx — 1 is convex and 
decreasing, and it has the unique root In r(a, 6). Newton’s method for solving this 
equation refines an approximation x by computing x = x + f(x)/(ae_ ax + be~ bx ). 
Notice that x is less than the root if and only if f(x) > 0; furthermore f(x) > 0 implies 
f{x) > 0, because f(x’）> f(x) + {x — x) f (x) when / is convex. In particular we have 
/(l/(a + b)) > 0, because /(0) = 1 and 0’ = l/(a + 6 ), and we can proceed as follows: 

Kl. [Initialize.] Set j k ^ 1^ x l/(ai + bi). 
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K2. [Done?] (At this point (aj ， 6 j) is the best of (ai ， 61 ) ， … ， (a/c，〜)，and e~ ajX + 
e~ bjX > 1.) Terminate if k = s. Otherwise set k ^ k ^ 1^ x l/(a/c + bk). 

K3. [Find a, /3.] If x < swap j ^ k and x ^ x . Then set a e~ ajX and 
/3 <r- e~ h ^ x， . Go to K2 if a + ^ < 1. 

K4. [Newtonize.] Set x •<— x (a — (aja-\-bj/3)^ a •<— e~ akX , /3 r •<— e- bkX , 

x x + (a +〆 一 \)/{akOL + bk/3’）, and return to K3. | 

(The floating point calculations should satisfy e u < e v and u + w < v + w when u < v.) 

170. If the problem is unsatisfiable, Tarjan’s algorithm discovers l and l in the same 
strong component. If it’s satisfiable, Algorithm X finds autarkies (because w is always 
zero), thus forcing the value of all literals at depth 0 . 

171. It prevents double-looking on the same literal twice at the same search tree node. 

172. When Algorithm Y concludes normally ， we’ll have T = BASE+LO [j] ， even though 
BASE has changed. This relation is assumed to be invariant in Algorithm X. 

173. The run reported in the text，using nonoptimized parameters (see exercise 513 )， 
did 29,194,670 double-looks (that is, executions of step Y2), and exited 23,245,231 
times to X13 in step Y 8 (thus successfully forcing lo false in about 80% of those cases). 
Disabling Algorithm Y (i) increased the running time from 0-68 teramems to 1-13 
teramems, with 24.3 million nodes. Disabling wraparound (ii) increased the time to 0-85 
teramems, with 13.3 million nodes. Setting Y = 1, which disabled wraparound only in 
Algorithm Y, yielded 0.72 teramems, 11.3 meganodes. (Incidentally, the loops of Algo¬ 
rithm X wrapped around 40% of the time in the regular mn, with a mean of 0.62 and 
maximum of 12; those of Algorithm Y had 20% wraparound, with a mean of 0.25; the 
maximum Y = 8 was reached only 28 times.) Disabling the lookahead forest (iii) gave 
surprisingly good results: 0.70 teramems, 8.5 meganodes; there were fewer nodes [hence 
a more discriminating lookahead], but more time spent per node because of duplicated 
effort, although strong components were not computed- (Structured problems that 
have numerous binary clauses tend to generate more helpful forests than random 3SAT 
problems do.) Disabling compensation resolvents (iv) made very little difference: 0.70 
teramems, 9-9 meganodes. But disabling windfalls (v) raised the cost to 0.89 teramems 
and 13.5 meganodes. And branching on a random l G LL (vi) made the running 
time soar to 40.20 teramems, with 594.7 meganodes. Finally，disabling Algorithm X 
altogether (vii) was a disaster，leading to an estimated run time of well over 10 20 mems- 

The weaker heuristics of exercise 175 yield 3.09 teramems and 35.9 meganodes. 

174. Setting y to a huge value such as PT will never get to step Y2. (But for (ii), (iii), 
… ， (vii) one must change the programs, not the parameters as they stand.) 

175. Precompute the weights, by setting K 2 = 1 and K s 7/^—1 +.01，for s between 
3 and the maximum clause size. (The extra .01 keeps this from being zero.) The third 
line of ( 72 ) must change to “take account of c for all c in KINX(L)where that means 
a set s CSIZE(c) — 1; if 5 > 2, set CSIZE(c) *<— s and w w + K s \ otherwise if all 
literals of c are fixed false, set a flag; otherwise if some literal u of c isn’t fixed (there 
will be just one), put it on a temporary stack.” Before performing the last line of ( 72 )， 
go to CONFLICT if the flag is set; otherwise, for each unfixed u on the temporary stack, 
set Wi u and i i + 1 and perform ( 62 ) with l u] go to CONFLICT if some u on the 
temporary stack is fixed false. (A “windfall” in this more general setting is a clause for 
which all but one literal has been fixed false as a consequence of lo being fixed true.) 

Of course those changes to CSIZE need to be undone; a simulated false literal that 
has been “virtually” removed from a clause must be virtually put back. Fortunately ， 
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the invariant relation (71) makes this task fairly easy: We set G ^ F in step X5, and 
insert the following restoration loop at the very beginning of (72): a While G > F, set 
u *<— Rg-i] stop if u is fixed in context T; otherwise set G G — 1^ and increase 
CSIZE(c) by 1 for all c G KINX (丑）•” The restoration loop should also be performed, 
with T NT, just before terminating Algorithm X in steps X7 or X13- 

[The additional step (*) in answer 166 can’t be used, because (71) is now crucial.] 
Algorithm Y should change in essentially the same way as Algorithm X. 

[See O- Kullmann，Report CSR 23-2002 (Swansea: Univ. of Wales，2002)，§4.2.] 


176 - (a) 

一 e 3 


-a j+ i, aj 

dj+i, 


b 3 


~+i ， 




c h bj dj ， Cj dj ， 


e 3 


/j+i，/j —— Cj+U fj —— e i+i- 


(b) Let ( tj, Uj, vj, wj, aj, bj, cj, dj, ej, fj ) have colors (1，2, 1，1， 1, 2,1， 3,3,2) when 
j is even, (2,1,2, 2, 3, 2, 3,1,1, 2) when j is odd. The lower bounds are obvious. 

(c) Vertices fj can’t all have the same color，because bj, Cj, dj have distinct 

colors. Let aj denote the colors of ajejfj. Then aj = 112 implies aj+i = 332 or 233; 


121 implies aj+i = 233 or 323; aj = 211 implies aj+i = 323 or 332 


123 


implies cxj+i = 213 or 321. Since ai = a g +i，the colors of a 1 must be distinct，and we 
can assume that ai = 123. But then aj will be an odd permutation whenever j is even. 

[See Rufus Isaacs, AMM 82 (1975)，233-234. Unpublished notes of E. Grinberg 
show that he had independently investigated the graph J5 in 1972.] 


177 . There are 20 independent subsets of Vj = {aj 5 bj , Cj 5 dj , ej , fj } when q > 1; eight 
of them contain none of {bj , Cj , dj} while four contain bj. Let A be a 20 x 20 transition 
matrix, which indicates when i?U(7 is independent for each independent subset R CVj 
and C C V}+i. Then I q is trace(A^); and the first eight values are 8, 126，1052, 
11170, 112828, 1159416, 11869768, 121668290. The characteristic polynomial of A, 
x 12 (x 2 -2x- l)(x 2 +2x- l)(x 4 - 8x 3 - 25x 2 + 20x + l), has nonzero roots 士 1 士 
and ^ —2.91，—0.05，+0.71，+10.25; hence I q = ©(〆），where r ^ 10-24811166 is the 
dominant root- Note: The number of kernels of L(J q ) is respectively 2, 32，140，536, 
2957, 14336, 70093, 348872, for 1 < g < 8, and its growth rate is « 4.93 q . 


178 . With the first ordering, the top 18k levels of the search tree essentially represent 
all of the ways to 3-color the subgraph {aj,bj,Cj,dj,ej, fj | 1 < j < A:}; and there are 
0(2^) ways to do that, by answer 176. But with the second ordering, the top 6kq levels 
essentially represent all of the independent sets of the graph; and there are D(10.2” of 
those, by answer 177- 

Empirically, Algorithm B needs respectively 1.54 megamems, 1.57 gigamems, and 
1.61 teramems to prove unsatisfiability when g = 9, 19, and 29, using the first ordering; 
but it needs 158 gigamems already for g = 5 with the second! Additional clauses, which 
require color classes to be kernels (see answer 14)，reduce that time to 492 megamems. 

Algorithm D does badly on this sequence of problems: When q = 19, it consumes 
37.6 gigamems, even with the “good” ordering. And when q = 29, its cyclic method 
of working somehow transforms the good ordering into a bad ordering on many of the 
variables at depths 200 or more- It shows no sign of being anywhere near completion 
even after spending a petamem on that problem! 

Algorithm L，which is insensitive to the ordering, needs 2.42 megamems, 2.01 
gigamems, and 1-73 teramems when g = 9，19， and 29. Thus it appears to take 0(2^) 
steps，and to be slightly slower than Algorithm B as g grows，although exercise 232 
shows that a clairvoyant lookahead procedure could theoretically do much better. 

Algorithm C triumphs here, as shown in Fig. 49. 
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179 . This is a straightforward exact cover problem. If we classify the solutions accord¬ 
ing to how many asterisks occur in each coordinate, it turns out that exactly (10, 240, 

180, 360, 720, 480 ， 1440, 270, 200, 480) of them are respectively of type (00088, 00268, 
00448, 00466, 02248, 02266, 02446, 04444, 22228, 22246). 

By complementation, we see that 4380 choices of 8 clauses are unsatisfiable; hence 
q 8 = l - 4380/( s 8 °) = 1 - 4380/28987537150 « 0.9999998. 

180. With N variables yj, one for each possible clause Cj, the function f(yi ， … ， yN)= 
[A{ C J I Vj = !} is satisfiable] is \J x f x (y), where f x (y) = [x satisfies /\{Cj \ yj = 1}] 
is simply /\{yj | x makes Cj false}- For instance if A: = 2 and n = 3, and if Ci, CV ，Cn 
are the clauses (xi V a ： 2 )，(xi V 无 3 ) ， (x 2 V 无 3 ), then fooi(yi, … ,?/i 2 ) = ^1 A ^7 A yn. 

Each function f x has a very simple BDD, but of course the OR of 2 n of them will 
not be simple. This problem is an excellent example where no natural ordering of the 
clause variables is evident, but the method of sifting is able to reduce the BDD size 
substantially. In fact, the clauses for A: = 3 and n = 4 can be ordered cleverly so that 
the corresponding 32-variable BDD for satisfiability has only 1362 nodes! The author’s 
best result for A: = 3 and n = 5, however, was a BDD of size 2,155,458. The coefficients 
of its generating function (exercise 7.1.4-25) are the desired numbers Q m . 

The largest such count, Q 35 = 3,449,494,339,791 ， 376,514,416, is so enormous that 
we could not hope to enumerate the relevant sets of 35 clauses by backtracking. 

181. The previous exercise essentially computed the generating function QmZ m ; 
now we want the double generating function m T“ m w l z m , where is the number 
of ways to choose m different A:-clauses in such a way that these clauses are satisfied by 
exactly l vectors xi ... x n . To do this，instead of taking the OR of the simple functions 
f x , we compute the BDD base that contains all of the symmetric Boolean functions 
S/(/o"_o, … ， /i_"i) for 0 < / < 2 n , as follows (see exercise 7.1.4-49): Consider the 
subscript a: to be a binary integer, so that the functions are for 0 < x < 2 n • Start 
with 5 / = 0 for —1 < / < 2 n , except that ^0 = 1 - Then do the following for x = 0, • • • ， 
2 n — 1 (in that order): Set Si = f x ^ Si^i ： Si for l = x 1, • • • ， 0 (in that order) • 

After this computation, the generating function for Si will be ^2 m T^ 7n z m . In 
the author’s experiments, the sifting algorithm found an ordering of the 80 clauses for 
k = 3 and n = 5 so that only about 6 million nodes were needed when x had reached 
24; afterwards, however, sifting took too long, so it was turned off. The final BDD base 
had approximately 87 million nodes, with many nodes shared between the individual 
functions Si. The total running time was about 22 gigamems- 

182 . To = 32 and T\ = 28 and T m = 0 for 71 < m < 80- Otherwise minT m < maxT m . 

183 . Let t m = Pr(T m = 1)，and suppose that we obtain clauses one by one until 
reaching an unsatisfiable set- The fact that t m gets reasonably large suggests that we 
probably have accumulated a uniquely satisfiable set just before stopping. (That proba¬ 
bility is N^2 m tm/ (TV —m), which turns out to be 0.8853 when k = 3 and n = 5.) 

However, except for the fact that both Figs. 42 and 43 are bell-shaped curves with 
roughly the same tendency to be relatively large or small at particular values of m, 
there is apparently no strong mathematical connection. The probabilities in Fig. 43 
sum to 1; but the sum of probabilities in Fig. 42 has no obvious significance. 

When n is large, uniquely satisfiable sets are encountered only rarely. The final 
set before stopping a.s. has at most /(n) solutions，for certain functions /; but how 
fast does the smallest such / grow? [See D. J. Aldous, J. Theoretical Probability 4 
(1991), 197—211, for related ideas-] 
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184. The probability q m is Q m /N m ^ where Q m counts the choices (Ci ， • • • ， （ 7 m ) for 
which Ci 八 • • • 八 Cm is satisfiable. The number of such choices that involve t distinct 
clauses is t\ {?} times Qt, because {?} enumerates set partitions; see Eq. 3.3.2-( 5 ). 

185. q m = Ef = o {^^)1 Ef： 0 

186 - Y,mJ2t can be summed on m, since = l/(N-l) t 

by Eq. 1.2. 9 -( 28 ). Similarly, the derivative of 1.2.9—( 28 ) shows that = 

(N/(N _ 1) + ... + N/(N- t))/(N - I) 1 -. 

187. In this special case, q m = [0 <m< N] and p m = [m = N]; hence S n , n = N = 2 n 
(and the variance is zero). By ( 78 )， we also have S n ,n = NHn\ indeed, the coupon 
collector’s test (exercise 3.3.2-8) is an equivalent way to view this situation. 

188. Now q m = 2 m n—/(2n)—. It follows by ( 78 ) that 5i, n = 2 m n m /(2n — l) m , 

because N = 2n. The identity 2 m n—/(2n —1)— = 2q m —q m+ i yields the surprising fact 

that Si^n = ( 2 ^ 0 -gi)+( 2 gi-^ 2 )H —— = 1 +Si， n ; and we also have 5i， n -l = 

Hence, by induction, we obtain the (even more surprising) closed forms 



So random IS AT problems become unsatisfiable after y / 7 m + 0{1) clauses, on average- 

189. With the autosifting method in the author’s experimental BDD implementation, 
the number of BDD nodes, given a sequence of m distinct clauses when k = 3 and 
n = 50， increased past 1000 when m increased from 1 to about 30， and it tended to 
peak at about 500,000 when m was slightly more than 100. Then the typical BDD size 
fell to about 50,000 when m = 150, and to only about 500 when m = 200. 

BDD methods break down when n is too large, but when they apply we can count 
the total number of solutions remaining after m steps. In the author’s tests with A: = 3, 
n = 50, and m = 200, this number varied from about 25 to about 2000- 

190. For example, Si(xi^... ^ x n ) can’t be expressed in (n — 1)CNF: All clauses of 
length n — 1 that are implied by Si (xi^ . •., x n ) are also implied by ^< 1 (^ 1 , •.., x n ). 

191. Let f(xo, … , X 2 n -i) = 1 if and only if xo . •. x‘ 2 打 一 1 is the truth table of a Boolean 
function of n variables that is expressible in A:CNF. This function / is the conjunction 
of 2 n constraints c ⑴, for 0 < t = ( 艺 o. •• 艺 2 n -i )2 < 2 n ，where c ⑴ is the following 
condition: If 以 = 0 ， then \J{x y \ 0 < y < 2 n ， ㊉ t) & m = 0 } is 0 for some n-bit 
pattern m that has vm = k. By combining these constraints we can compute the BDD 
for / when n = 4 and A: = 3; it has 880 nodes, and 43,146 solutions. 

Similarly we have the following results，analogous to those in Section 7-1-1: 



n = 0 

n = 1 

n = 2 

n = 3 

n = 4 

n = 5 

n = 6 

1CNF 

2 

4 

10 

28 

82 

244 

730 

2CNF 

2 

4 

16 

166 

4,170 

224,716 

24,445,368 

3CNF 

2 

4 

16 

256 

43,146 

120,510,132 

4,977,694,100,656 


And if we consider equivalence under complementation and permutation，the counts are: 


1CNF 

2 

3 

4 

5 

6 

7 

8 

2CNF 

2 

3 

6 

14 

45 

196 

1,360 

3CNF 

2 

3 

6 

22 

253 

37,098 

109,873,815 


192. (a) S(p) = Y ： ^ =0 p m (l-p) N ~ m Qm. (b) We have J Q N (t/N) m (l - t/N) N ~^ dt = 
NB(m + l,iV-m + l) = ^/Q, by exercises 1.2.6—40 and 41; hence Sk,n = 
Sm=o = ivTT^^- [See B. Bollobds ，Random Graphs (1985), Theorem IL4.] 
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194. A similar question, about proofs of wnsatisfiability when a > lim S^^n/n^ 

is also wide open. 

195. EX = 2 n Pr(0... 0 satisfies all) = 2 n (l-2 - k ) m = exp(nln2 + mln(l - 2—”) < 
2exp(—2 _A ， _ 1 nIn2). Thus 5/ c (L(2 fc In2)nJ, n) = Pr(X > 0) < exp(—0(n)). [Discrete 
Applied Math. 5 (1983), 77-87. Conversely, in J. Amer. Math. Soc. 17 (2004), 947-973, 
D. Achlioptas and Y. Peres use the second moment principle to show that {2 k In 2 — 
0{k))n random /cSAT clauses are almost always satisfiable by vectors x with vx 7^ n/2. 
Careful study of “covering assignments” (see exercise 364) leads to the sharp bounds 

2 fc ln 2 — l+J 11 2 — 0 ( 2 ~~"^) < lim inf ol^ (n) < limsupa/ c (n) < 2 fe ln 2 — ^ + 0 ( 2 _ 云 ); 

2 _ n-^oo _ n -^oc — 2 


see A. Coja-Oghlan and K. Panagiotou ， arXiv: 1310.2728 [math.CO] (2013)，48 pages-] 

196. The probability is ((n — t)-/n-) an+ °^ = e~ kta {\ + 0(l/n)) that an + 0(1) 
random /cSAT clauses omit t given letters. Let p = 1 — (1 — e~ ka ) k . By inclusion and 
exclusion, the first clause will be easy with probability p(l + 0 (l/n))，and the first two 
will both be easy with probability p 2 (l + 0(l/n)). Thus if X = [clause j is easy], 
we have EX = pm+0(l) and EX 2 = p 2 m 2 +0(m)- Hence, by Chebyshev’s inequality, 
Pr(|X-pm| > r^/Wi) = 0(l/r 2 ). 

197. By Stirling’s approximation, lng(a ， 6 , A ， B ， n) = n/(a ， 6 , A ， B) + p(a ， 6 ， A ， B)— 

\ In 27m — {8an — d( a +6)n) — (^bn — ^(b-\-B)n) ~ ( 心 n — <^(a+A)n) — (^Bn — ^{A+B)n) ~ 

J( a+ 6 +A + B)n，where S n is positive and decreasing. And we must have f(a,b. A, B) < 0, 
since g(a ， 6 , A ， B ， n) < 1. The O estimate is uniform when 0 < S < A^B < M• 

198. Consider one of the N M possible sequences of M 3SAT clauses, where N = 8(^) 
and M = bn. By exercise 196 it contains 分 = 5(1 — (1 — e 一 15 ) 3 )n + 0(n 3 / 4 ) easy 
clauses, except with probability 0{n~ 1 ^ 2 ). Those clauses, though rare, don’t affect the 
satisfiability; and all (^) of the ways to insert them among the r = M — g others are 
equally likely，so they tend to dampen the transition. 

Let l < r be maximum so that the first l noneasy clauses are satisfiable, and let 
p(Z ， r ， 分， m) be the probability that, when drawing m balls from an urn that contains g 
green balls and r red balls, at most l balls are red. Then Ssim,n) = ^2p{l ， r, g, m)/N M 
and 53 ( 爪’， n) = g,m!) /N M , summed over all N M sequences. 

To complete the proof we shall show that 

p(Z ， r, 夕 ， m + 1) = g ， m) — 0(n 一 " 2 ) when 3.bn < m < 4.5n; 

hence Ss(m-\- 1 , n) = Ss(m^ n) — 0 (n -" 2 ) ， n) — Ss(m f ^n) = 0((m’ — m)n - 1 ")• 
Notice that g, m) = m+1) when m < l or m > 1 + g\ thus we may assume 

that l lies between 3.4n and 4.6n. Furthermore the difference 


dm = - p(l, r ， p，m + 1) 




r 


r r 9 ) 


O r + g-m 


has a decreasing ratio dm/dm-i = (m/ (m — /))((/ + ^ + 1 — m)/{r + g — m)) when m 
increases from Z to Z + 分 • So maxd m occurs at m ^ / (r + g) 卜 , where this ratio is s 1. 
Now exercise 197 applies with a = //n, b = pg/n^ A = (r—V)/n,B = (l—p)g/n^ p = l/r. 

[D. B. Wilson，in Random Structures & Algorithms 21 (2002) ， 182—195, showed 
that similar methods apply to many other threshold phenomena.] 
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199. (a) Given the required letters {ai，. • • ， a，}，there are m ways to place the left¬ 
most ai，then m — 1 ways to place the leftmost a‘ 2 , and so on; then there are at most 
N ways to fill in each of the remaining m — t slots. 

(b) By inclusion and exclusion: There are (N—k) m words that omit k of the letters. 
(C) N-^Z k =Zj (7)(-1)^^^-, where A,= 

Ylk GK- 1 广 w = O! by Eq. l 2 .6-( 53 ). 

200• (a) The unsatisfiable digraph must contain a strong component with a path 

It — “l — > —> It — ^ h+i — >• • • • — >li = 及， 

where / 1 , • • ” It are strictly distinct. This path yields an 5-snare ((7; ^ u) if we set s to 
the smallest index such that |“ +1 | = \l u \ for some u with 1 < u < s. 

(b) No: (x V t/) A V x) A (x V y) and (x\/ y) A (y\/ x) A(x\/ y) are both satisfiable. 

(c) Apply exercise 199(a) with t = s + 1，TV = 2n(n— 1); note that m— < m s+l . 

201. (a) Set (/i, h+i) •<— (^ 1 ,^ 2 ) or ( 无 2 ,无 1 )，where 0 <i < 2t (thus At ways). 

(b) Set (L ， L+i ， 夂 + 2 ) l ($ 1 ，尤 2 ，尤 3 ) or ( 无 3 , 无 2 , 无 1 )，where 0 < i < 2t; also 

or (h-i ， 心，七 （尤 1 ， _ 尤 2 ， $ 3 ) or ( 无 3 , 无 2 ,无 1 ) (total + 4 ways, if t>2). 

(c) (hjt-ijt) or (xi,x t -i,x t ) or (x t -i,xi,x t ) (4 ways). 

(d) k or ht - i ^Xi or xt-i^ for l<i<t (4 ways, if you understand this notation). 

(e) By part (a) ， it is 2t x 4t = 8t 2 . 

(f) Parts (b) and (c) combine to give iV(3,2) = (2t + 2) x (4t + 4) + 2 x 4 = 

8(t 2 + + 2) when t > 2. From part (d) ， N(t^ t) = 8 . Also N(2t — 1, 2t) = 8 ; this is 

the number of snakes that specify the same 2t clauses. (Incidentally, when t = b the 
generating function ^2 q ^ r N(q,r) w q z r is l-\-200w 2 z 1 (296w 3 76SSw 4 )z 2 + (440w; 4 + 

12800w; 5 + 55488 切 6 ) 之 3 + (640w 5 + 12592w; 6 + 66560w 7 + 31104w; 8 )^ 4 + ( 8 w 5 + 736w 6 + 
8960w 7 + 32064w; 8 + 6528w 9 )^ 5 + (32w 6 + 704w 7 + 4904w 8 + 4512w 9 )^ 6 + (48w 7 + 
704w 8 + 1232w 9 )z 7 + (64 忉 8 + 376w 9 ) 之 8 + 80w 9 z 9 + Sw 9 z 10 .) 

(g) The other Vs can be set in at most 2 2t ~~ 1 ~ q (n — q) 2t ^ l ^ q = R/(2 q n^) ways. 

(h) We may assume that r < 2t. The r chosen clauses divide into connected 

components, which are either paths or a “central” component that contains either 
(xo V xi) and (xt-i V xt) or (xt V xt+i) and V xo). Thus q equals r plus the 

number of components, minus 1 if the central component includes a cycle. If the 
central component is present, we must set It xt or xt^ and there are at most 8 ways 
to complete the mapping of that component. And N(r^r) = 16(r + l —t) for t < r < 2t. 

Cases with k > 0 paths can be chosen in at most ( 2 ^ 2 ) ways, because we choose 
the starting and ending points，and they can be mapped in at most 2 k k \ 〔 2 二 2 ) ways; 
so they contribute X^ 〉。 0(t 4 /c 々 /(A:! 3 ?^)) = 0{t A /n) to (2n) r p r . The noncyclic central 
components, which can be chosen in 0 (t 4 ) ways, also contribute 0{t A /n). 

202 . (a) m(m—l)... (m—r-\-l)/m r > ( 1 —(=)/m); ( 2 n(n— 1 )—r) m_r /( 2 n(n—l)) m_r > 
1 — (m — r)r/(2n(n — 1 )) when r < m < 2 n(n — 1 ); and both factors are < 1 . 

(b) The term of ( 95 ) for r = 0 is 1 plus a negligible error. The contribution of 
0{t A /n) for r > 0 is 0(n 4 / 5+1 ’ 6_1 ), because E r>0 (l + n _1 ^ 6 ) _r = n 1 ’ 6 + 1 . And 
the contributions of ( 96 ) to ( 95 ) for r > t are exponentially small, because in that 
range we have (1 + n~ 1 / 6 )~ t = exp(—t ln(l+n-" 6 )) = exp(—^(n 1 / 30 )). Finally, then, 
by the second moment principle MPR—( 22 ) ， S 2 ([n + n°^\n) < 1 — Pr(X > 0) < 
1 - (EX)7(EX 2 ) = 1 - 1/((EX 2 )/(EX) 2 ) = 1 - 1/(1 + 0(n _1/30 )) = 0(n~ 1/30 ). 

203. (a) EX = EX(1，• • • ， 1)，by symmetry; and EX(1 ， •… ， 1) = (1 — p) m , because 

each set of q clauses is falsified with probability p. So EX = exp((rln(l —p) + l)nln d) 
is exponentially small when r ln(l — p) + 1 < 0; and we know that Pr(X > 0) < EX. 
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(b) Let 0 S = (2) / (2) = n(n^-i) ^ an d consider a random constraint set，given Tovey 

that JC(1 ， ...,1) = ]_• With probability Q s , both u and v have color 1 and the 
constraint is known to be satisfied. But with probability l — 6 s /it holds with probability 

(^' 2 )/(^ -1 )- Thus p s = (Os + (1 - Os)(d 2 -pd 2 - 1)/(/ - l)r. 

(c) We have Pr(X > 0) > d n (1 —p) m /E(X | X(l, … ， 1) = 1), from the inequality 

and symmetry; and the denominator is X^=o C)(d — 1 广一 3 外 . can replace % by 
the simpler value p s = (1 — p + ps 2 /n 2 ) m ^ because p s < (0 S + (1 — 0 5 )(1 — p)) m = 

(1 — p -h 0 s p) m < p's- And we can divide the simplified sum by d n \l — p) m . 

(d) We have = e o ( ^) ( ： )( i )5(l _ because s 2 /n 2 = 

0{l/d 2 ) when s < 3n/d. This sum is > 1 — (e 2 /27) n " by exercise 1.2.10-22; and the 
crucial assumption that a > | - makes m/d 2 —> 0 . 

(e) Transition between increase and decrease occurs when x s ~ 1 ； and we have 

n — s 1 ( 、 ( 2 s + l)p \ m A 1—(7 ( 2 pra . A 

^- 7 TT^Tl 1+ ^ exp^ln —- + - lj In^J 

when s = an. Let f(a) = 2pra/(l — p + pa 2 ) — 1, and notice that /’(<j) > 0 for 

0 < a < 1 because p < |. Furthermore our choice of r makes /(|) < 0 < /(l)- 

Setting g(a) = f(a)/\n we seek values of a with g(a) = l/\nd. There are three 

such roots, because g(l/N) ^ —f(0)/\nN > 1/lniV; 士 1/iV) 义干 J(*)iV/4; and 
g(l-l/N)^f(l)/\nN. ~ 

(f) At the second peak, where s = n — n/d^ l \ we have (see exercise 1-2.6-67) 

ts< =exp ((- e + Od/d /( 1 ) ))nlnd), 

which is exponentially small. And when s = 3n/d, t s < ( 晋 ） s e 0 ( m " 2 ) = 0((e/3) 3n ") 
is also exponentially small. Consequently ^2^ =3n / d t s is exponentially small- 

[This derivation holds also when the random constraints are A:-ary instead of 
binary, with q = pd k and a > 1/k. See J. ArtiRcial Intelligence Res. 12 (2000) ， 93—103.] 

204. (a) If the original literals 士 that involve variable Xj correspond to criX^i^ 

(T p X^ p ^ with signs add the clauses (—crhXi^W a h +X i( ^ h +^) for 1 < h < p to enforce 
consistency, where h + = l+(/i modp). (This transformation, due to C- A. Tovey, works 
even in degenerate cases- For example, if m = 1 and if the given clause is (xi W xiW x‘ 2 )， 
the transformed clauses are {X\ V X 2 V Xs)^ (Xi V X 2 ), (X 2 V X\)^ (Xs V Xs).) 

(b) After Fo = {e}, F\ = Fo LJ Fo, F 2 = Fo U Fi, F 3 = Fo U -F 2 , F 4 = F s U 

F 5 = F 4 U , always putting the new variable into the four shortest possible clauses, 

we get F 5 = {345, 234, 123, 123, 3’ 】 5, 2’3’4, 1 , 2 , 3 , , 3"4"5, 2"3"4" ， l’W ’， 

3 ， "2"§, 2H", 

(c) If we delete 123 from F 5 there are 288 solutions，namely 1 八 2 八 3 八 4 八 5 八 c ’ 八 
(4"? c" A 3 ，"： c" ， 八 3"), where c = 2 V 3. 

(d) Add [m/ 2 ] disjoint clones of the 15 clauses of (c) to the 4m clauses of (a )， 
giving m + 15[m/2] 3 - clauses and 3m 2-clauses that are satisfiable only if all literals 
cloned from 2, or 3 are false. Each clone provides six such false literals {1 5 2, 2, 3} 
without using any variable five times. So we can stick those literals into the 2 - clauses, 
obtaining ^ 11.5m 3-clauses in iV » 10.5m variables. (The new clauses have 288「 m / 2 1 
times as many solutions as the original ones. Can the ratio N/m ^ 10-5 be lowered?) 

205. Let Fo = {e}, F\ = Fo U Fo, F 2 = Fo U Fi, Fs = Fo U F 2 , F 4 = Fo U 

F 5 = Fi U Fq = Fo U F 5 , F 7 = FqU Fg = F 4 LI Fg = Fo U F 10 = F 7 U Fq^ 
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Fn = F 7 U F{ 0l F 12 = F 0 U Fh, Fis = F 9 U F 12 , F 14 = F 10 LJ 巧 5 = F 12 U , 

Fi 6 = F 1S U F^\ F 17 = F 14 U F[l\ F 18 = F 16 U i^ 13 ). (Here c x (3), stands for V"，, 
etc.) Then Fig consists of 257 unsatisfiable 4-clauses in 234 variables* 

(Is there a shorter solution? This problem was first solved by J- Stribrna in her 
M.S. thesis (Prague: Charles University, 1994), with 449 clauses- The U method was 
introduced by S. Hoory and S. Szeider ，Theoretical Computer Science 337 (2005), 347— 
359， who presented an unsatisfiable 5SAT problem that uses each variable at most 7 
times. It’s not known whether 7 can be decreased to 6 when every clause has size 5.) 

206. Suppose F and F f are minimally unsatisfiable, and delete a clause of FU F f that 
arose from F f •, then we can satisfy F U F f with x true- 

Conversely, if FUF r is minimally unsatisfiable, F and F r can’t both be satisfiable- 
Suppose F is unsatisfiable but F r is satisfied by L r . Removing a clause of F U F r 
that arose from F r is satisfiable only with x true; but then we can use 1/ to satisfy 
F U F f . Hence F and F f are both unsatisfiable- Finally，if F \ (7 is unsatisfiable, so is 
(F U F f ) \ {C I x)^ because any solution would satisfy either F\C or F f . 

207. The five clauses of (7(x ， 仏么； a ， 6 , c) = {xab^ ybc, zca, abc, abc] resolve to the single 
clause xyz. Thus C(x^ " ， 1 ， 2, 3) U (7(x ， 仏沒 ; 4, 5, 6 ) U (7( 无， z ， 么 ; 7, 8 , 9) U (7( 无，乏，乏 ; a ， 6 , c) 
is a solution. [K. Iwama and K. Takaki ，DIMACS 35 (1997), 315—333, noted that the 
16 clauses {xyz} U (7(x, x ， 1 ， 2, 3) U (7(y ， y ， 4, 5, 6 ) U ( 7 ( 2 ：， 之 ; 7, 8 , 9) involve each 
variable exactly four times, and proved that no set of twelve clauses does so.] 

208. Make m clones of all but one of the 20 clauses in answer 207, and put the other 3m 
cloned literals into the 3m binary clauses of answer 204(a). This gives 23m 3 - clauses 
in which every literal occurs twice, except that the 3m literals X{ occur only once- 

To complete the solution, we “pad” them with additional clauses that are always 
satisfiable. For example, we could introduce 3m more variables Ui, with new clauses 

for 1 < i < 3m and { u 3 ju 3 j + ! u 3 j +2 , u^ju 3 j +1 u 3 j +2 } for 1 < j < m (treating 
subscripts mod 3m)，where u\ denotes {i even? m ： Ui). 

209. Since the multiset of kt literals in any t clauses contains at least t different vari¬ 
ables 5 the “marriage theorem” (Theorem 7.5-1M) implies that we can choose a different 
variable in each clause, easily satisfying it- [Discr. Applied Math. 8 (1984) ， 85-89.] 

210. [P- Berman, M. Karpinskiy A. D. Scott, Electronic Colloquium on Computational 
Complexity (2003), TR22.] This answer uses the magic number e = S 7 ^ 1/58, where 
S is the smallest root ofJ((l—J 7 ) 6 + (l — J 7 ) 7 ) = l. We will assign random values to 
each variable so that Pr[all clauses are satisfied] > 0. 

Let r/j = (1 — ey /((l — sy + (1 — e) 13_J ), and observe that rjj < <5(1 — for 
0 < j < 13. If variable x occurs d + times and x occurs d 一 times, let x be true with 
probability rj d -, false with probability 1 — r\ d - = rj 13 _ d - < J(1 — e) 12, ~ d ~ < <5(1 — e) d+ . 

Let bad((7) = [clause C is falsified by the random assignment], and construct 
the lopsidependency graph for these events as in exercise 351. Then, if the literals 
of (7 = (Zi V … V Z 7 ) have contrary appearances in di, … ，办 other clauses, we have 

Pr(bad((7)) < 0(1 -e 产） … (J(l _ 幻和） = £ {l-e) dl+ - +d7 < s(l-s) desree(c \ 

because C has at most di H -+ d 7 neighbors. Theorem L, with parameter Oi = e for 

each event bad((7)，now tells us that Pr[all m clauses are satisfied] > (1 — e) m . 

[See H. Gebauer, T. Szabo, and G. Tardos, SODA 22 (2011), 664-674, for 
asymptotic results that apply to /cSAT as k —>■ oo.] 
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211. If m clauses in n variables are given, so that 3m = let N = 8n. Consider N 
“colors” named jk or jk^ where 1 < j < n and k is one of the four clauses that contains 
士 Xj. Let a be a permutation on the colors, consisting of 4-cycles that involve the same 
variable, with the properties that (i) (jk)a = jk’ for some k r and (ii) (jk)a = (jk)a. 

There are An vertices of Kn named jk, having the respective color lists 

= {jk,jk}, L(jk, 2) = {jk, (jk)a}, L(jk,3) = {jk, (jk)a}. 

The other 3m vertices of Kn are named a/c, bk^ Ck for each clause k. If that clause is, 
say, X 2 V X 5 V the color lists are 

L(a/c, 1) = {2A: ， 5A: ， 6A:}, L(6/c ， 1) = L(Ck, 1) = {2A: ， 2A: ， 5A: ， 5A:, 6A: ， 6A:}; 

L(a k ,2) = {(2^V), L(b k ,2) = {(5^}, L(c k ,2) = {(6^V); 

L(ak, 3) = {(2k)a 2 , (2k)aj, L(bk, 3) = {(5k)a 2 , (5k)crj, L(ck,3) = {(6k)a 2 , (6k)aj. 

Then Kn □ K 3 is list-colorable if and only if the clauses are satisfiable- (For example, 
(jk^ 1) is colored jk 1) is colored (jk)cr (a/c ， 1) is not colored jk.) 

212. (a) Let Xijk = 1 if and only if Xij = k. [Note: Another equivalent problem is to 
find an exact cover of the rows { {Pij ， Rik^ Cjk} \ pij = rik = Cjk = l}- This is a special 
case of 3D matching; see the discussion of sudoku in Section 7.2.2.1. Incidentally, the 
3D matching problem can be formulated as the problem of finding a binary tensor (Xijk) 

1100\ /1010、 

(b) c 3 i = C32 = ri 3 = ri 4 = 0 forces xi^ =0^pi 3 when r = c= K p= - 

\1001 / \0011/ 

(c) Make L(I, J) = {1,.... N} for M < I < N, 1 < J < N. It is well known 
(Theorem 7.5.1L) that a latin rectangle can always be extended to a latin square. 

(d) Index everything by the set {1，• • • ， TV} U 1J 7 J^K) \ K ^ L(J ， J)}. The 
elements (I ， J, K) where K = min L(/, J) are called headers. Set pij = 1 if and only 
if (i) i = j = (I, J, K) is not a header; or (ii) i = (/ ， J ， K) is a header, and j = J or 
j = (I,J, K f ) is not a header; or (iii) j = (/， J, K) is a header, and i = I or i = (J, J, K’) 
is not a header. Set rik = ca = 1 if and only if (i) I < i^k < N] or (ii) i = (I, J, K) 
and k = (/， J, K’）, and if i is not a header then (K f = K or K f is the largest element 
<K in L(I,J)). [Reference: SICOMP 23 (1994), 170-184.] 

213. The hinted probability is (1 — (1 — p ) n， (1 — q) n ~ n ') m ^ where n = b\ + ... + 6 n . 
Thus if p < every x has probability at least (1 — (1 — p) n ) m of satisfying every 
clause. This is huge, unless n is small or m is large: If m is less than a n , where 
a is any constant less than 1/(1 — p)，then when n > —1/ lg(l — p) the probability 
(1-(1- p) n ) m > exp(a n ln(l - (1 - p) n )) > exp(-2(a(l - p)) n ) > 1 - 2(a(l - p)) n is 
exponentially close to 1- Nobody needs a SAT solver for such an easy problem. 

Even if, say, p = q = k/(2n)^ so that the average clause size is /c, a clause is 
empty —— hence unsatisfiable —— with probability e_ k + 0(n~ 1 ); and indeed a clause has 
exactly r elements with the Poisson probability e~ k k r /r\ + 0(n _1 ) for fixed r. So the 
model isn’t very relevant- [See J. Franco, Information Proc. Letters 23 (1986), 103-106.] 

214. (a) T(z) = ze z -1- 2T(pz)(e (1 ~ p)z - 1). 

(b) If f(z) = — e( p-1 ) z ’ pm ) and r(z) = f(z)T(z)e~ z , we have r(z)= 

zf(z) + 2r(pz) = zf(z) + 2pzf(pz) + 4p 2 zf(p 2 z) + .... 

(c) See P- Jacquet, C. Knessl, and W- Szpankowski, Com bin atorics^ Probability, 
and Computing 23 (2014) ， 829-841. [The sequence (T n ) was first studied by A. T. 
Goldberg, Cour ant Computer Science Report 16 (1979), 48-49.] 


such that x^k < Vijk and Xi^ = = 1， given {yijk)] 
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215. Since any given xi ... xi is a partial solution in ( 8 Q) — ①广 of the ( 8 (:)) m 
possible cases, level l contains Pi = 2’(1 — nodes on the average- When m = 

An and n = 50, the largest levels are (P 31 ， P 32 , … ， P?>q) ~ (6.4, 6.9, 7.2, 7.2, 6 . 8 , 6.2) x 

10 ， and the mean total tree size Po H - + P 50 is about 85.6 million. 

If I = 2tn and m = an we have Pi = 2 ⑽ n , where f(t)= 2t+alg(l —t 3 ) + 0(l/n) 
for 0 < t < 1/2. The maximum f(t) occurs when ln4 = 3at 2 /(1 — t 3 )^ at which point 
t = t a = /3 — |/5 4 + |^ 7 + 0(/3 10 ), where p = y^ln4 / (3a); for example, ^ 0.334. Now 

PL+k = 2 E , E = (2i a +alg(l-^))n+l-2^-^+o(-)+o(^), 7 = , _ ln2 , 

Tl \ Tl / \ Tb / JLiOlAjQj 


when L = 2t a n] hence the expected total tree size is 〜 + 0(l/^/n)). 

[This question was first studied by C. A. Brown and R W. Purdom ， Jr .，SICOMP 
10 (1981) ， 583-593; K. M. Bugrara and C. A. Brown, Inf. Sciences 40 (1986), 21-37.] 


216. If the search tree has q two-way branches, it has fewer than 2nq nodes; we shall 
find an upper bound on Eg. Consider such branches after values have been assigned 
to the first l variables xi, • • • ， xi^ and also to 5 additional variables yi) ••” y s because 
of unit-clause forcing; the branch therefore occurs on level t = 1 + s. The values can be 
assigned in 2 f ways，and the y^s can be chosen in ( n "" 5 1_/ ) ways- For 1 < i < s the m 
given clauses must contain ji > 1 clauses chosen (with replacement) from the F = ( 士 ； 1 ) 
that force the value of yi from other known values. The other m — ji ―… 一 j s must 
be chosen from the R = 8 (:) — sF — ( 墓） 一 2(^) (n — t) remaining clauses that aren’t 
entirely false and don’t force anything further. Thus the expected number of two-way 


branches is at most 


Pit = 2 


n—l—1 

s 


.E 

31 ，•••，、>1 


m 

ji ， … ， js ， rn—j 


F J R 


m — 


N 


m 


， j = ji +- js, N = 8 


( 


n 


summed over 0 < l < t < n. Let b = F/N and c = R/N; the sum on ji， ••” is 


I r mi ( bz -1 \5 

ml [z \ (e — 1) e 


s cz 


EC)(-d 


s — r 


r 


r 


(…(汀. 

q 


These values Pit are almost all quite small when m = 200 and n = 50, rising above 100 
only when l > 45 and t = 49; ^ Pit ^ 4404.7. 

Ifl = xn and t = yn^ we have b « |?/ 2 /n and c ^ 1— |(3( 沒一尤 ) 沒 2 +" 3 +6" 2 (1 — "))• 

The asymptotic value of [z an ] [eP z ^ n — \') Sn e lz can be found by the saddle point method: 
Let C satisfy /3Se^/(e^ — 1) + 7 = a(5jQ, and let p 2 = a/Q 2 — de^/(e^ — l) 2 . Then the 
answer is approximately (e^ — l) 6n e lC, ^y/n// (3) an+l ) . 

[For exact formulas and lower bounds, see SICOMP 12 (1983) ， 717-733 - The total 
time to find all solutions grows approximately as (2(|) a ) n when a < 45, according to 
H.-M. Mejean, H. Morel, and G. Reynaud, SICOMP 24 (1995), 621-649.] 

217. True, unless both l and l belong to A or to B (making A or B tautological). For 
if L is a set of strictly distinct literals that covers both A and we know that neither 
A nor B nor L contains both l and l; hence L\{1^ 1} covers (A\{l^ /})U (B\{1^ l}) = C. 

(This generalization of resolution is ， however，useless if C D A or C ^ because 
a large clause is easier to cover than any of its subsets. Thus we generally assume that 
l G A and IG B, and that C isn’t tautological, as in the text.) 

218. xl B: A. [Hence (x V A) A (x V B) always implies A\J B. 
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219. If C r or C n is tautological (p)，we define poC = Cop = C. Otherwise, if there’s 
a unique literal l such that C r has the form l\ZA’ and C" has the form fvA", we define 
C f o C n = A / WA ff as in the text. If there are two or more such literals, strictly distinct, 
we define C' o C" = p. And if there are no such literals, we define C' o C" = C, M C" • 

[This operation is obviously commutative but not associative. For example, we 
have (x oy) o (x\/ y) = p while x o (y o (x \/ y)) = e.] 

220. (a) True: If (7 C (7 / and C' C C" and C" ^ p then C' ^ p; hence every literal 
of C appears in C f and in C" • [The notion of subsumption goes back to a paper by 
Hugh McColl, Proc. London Math. Soc. 10 (1878), 16-28.] 

(b) True: Otherwise we’d necessarily have (C oC f ) W a W a + p and C ^ p and 
C' 半 p and C o C' ^ (7 V (7 ’； hence there’s a literal l with C = / V^4, C' = l\I A!, 
and the literals of A W A f W a W a are strictly distinct. So the result is easily checked, 
whether or not a or a contains l or 1. (Notice that we always have C oC f C (7 V C f .) 

(c) False: xy ^ p but xoxy = y<^x = xop. Also e C x but xoe = x ^ e = xox. 

(d) Such examples are possible if C ^ e: We have x^x h y (and also x^x h p), 
although the only clauses obtainable from x and x by resolution are x, x, and e. (On 
the other hand we do have F \- e if and only if there’s a refutation chain ( 104 ) for F.) 

(e) Given a resolution chain C[^ ••• ， C f m+r ^ we can construct another chain 
Ci， …, Cm+r in which ft C (7* for 1 < i < m + r. Indeed ，if i > m and C\ = Cj oC’ k , 
it’s easy to see that either Cj o Cj or Ck o Ck or Cj o Ck will subsume C[. 

(f) It suffices by (e) to prove this when ai = … = a m = a\ and by induction 
we may assume that a = / is a single literal. Given a resolution chain (7i，• • • ， C m -\- r 
we can construct another one C[^ ••” C f m+r such that C[ = Ci W l for 1 < i < m and 
C\ C Ci\/l for m + 1 < i < m+r, with Cl = Cj or C' k or Cj oC' k whenever C{ = Cj oCk- 

221 . Algorithm A recognizes ‘1， as a pure literal，but then finds a 
contradiction because the other two clauses are unsatisfiable. The 
resolution refutation uses only the other two clauses- (This is an 
example of an unnecessary branch. Indeed, a pure literal never appears 
in a refutation tree, because it can’t be canceled; see the next exercise.) 

222. If A is an autarky that satisfies (7, it also satisfies every clause on the path to e 
from a source vertex labeled (7, because all of the satisfied literals cannot simultaneously 
vanish. For the converse, see Discrete Appl. Math. 107 (2000) ， 99—137, Theorem 3-16. 

223• (The author has convinced himself of this statement, but he has not been able to 
construct a formal proof.) 

224. At every leaf labeled by an axiom A of F | x that is not an axiom of change 
the label to A\J x] also include x in the labels of all this leaf’s ancestors. We obtain a 
resolution tree in which the leaves are labeled by axioms of F. The root is labeled 

if any labels have changed; otherwise it is still labeled e. 

[See J. A. Robinson ，Machine Intelligence 3 (1968), 77-94.] 

225. Let’s say that a regular resolution tree for clause A is awkward if at least one of 
its nodes resolves on one of the variables in A. An awkward tree T for A can always be 
transformed into a regular non-awkward tree T f for some clause A / C where T r is 
smaller than T. Proof: Suppose T is awkward, but none of its subtrees are. Without 
loss of generality we can find a sequence of subtrees Tb, … ， T p , T (, …， where 
To = T and Tj 一 1 for 1 < j < p is obtained from Tj and Tj by resolving on the variable 
Xj] furthermore x p G A. We can assume that the labels of Tj and Tj are Aj and 
where Aj = Xj U Rj and A f j = Xj U Rj] hence Aj-i = Rj U R f j. Let B p = A p ] and for 
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j = p _ 1, p — 2, • • •, 1, let Bj = Bj-^i if Xj ^ Bj+i, otherwise obtain Bj by resolving 
Bj+i with Aj. It follows by induction that Bj C x p U Aj-i. Thus Bi C x p U Ao = 
and we’ve derived B\ with a non-awkward tree smaller than T. 

Now we can prove more than was asked: If T is any resolution tree that derives 
clause and if A\J B is any clause that contains there’s a non-awkward regular 
resolution tree T r no larger than T that derives some clause C C A\J B. The proof 
is by induction on the size of T: Suppose A = A f U A /f is obtained at the root of T 
by resolving the clauses xU A f with x U A ff that label the subtrees T f and T" • Find 
non-awkward regular trees T 二 and T f r f that derive C r and C", where C r C A 1 \JB and 
C" C x U A /7 U B. If x ^ C f and x ^ C r \ we obtain the desired T r by resolving T f r and 
T f r f on x. Otherwise we can either let C = C f and T r = T“ or C = C” and T r = T’ r ’• 
[It’s interesting to apply this construction to the highly irregular resolutions in ( 105 ).] 

226. Initially a is the root ， C(a) = e ， ||a|| = iV，and 5 = 0. If a isn’t a leaf，we 
have C(a) = C(a f ) o C{a n ) where x G C{a r ) and x G C{a ,r ) for some variable x. The 
Prover names and changes a a or a i- a" if the Delayer sets x 0 or x 1, 
respectively. Otherwise min(||a’|| ， ||a // ||) < ||a||/2，and the Prover can keep going. 

227. The proof is by induction on the number of variables 5 n: If F contains the empty 
clause, the game is over，the Delayer has scored 0, and the root is labeled 0. Otherwise 
the Prover names and the Delayer considers the smallest possible labels (m, m) on 
the roots of refutations for F \ x and F | x. If m > m\ the reply x 0 guarantees 
m points; and the reply x ^ * is no better, because m 7 + 1 < m. If m < the 
reply x *<— 1 guarantees m ’； and if m = m’，the reply x ^ * guarantees m + 1. Thus 
an optimum Delayer can always score at least as many points as the root label of any 
branch of a refutation tree constructed by the Prover. Conversely，if the Prover always 
names an optimal the Delayer can’t do better. 

(This exercise was suggested by O. Kullmann. One can compute the optimum 
score “bottom up” by considering all 3 n possible partial assignments as in answer 133.) 

228. We need only assume the transitivity clauses of ( 100 ) when i < j and k < j. 
[Notice further that Tijk is tautological when i = j ot k = j, thus useless for resolution.] 

229. Using the binary-relation interpretation, these clauses say that j ^ that the 
transitive law u i -< j and j k implies i -< holds whenever i < k and j < and 
that every j has a successor such that j -< k. The latter axiom combines with the 
finiteness of m to imply that there must be a cycle jo ^ ji 弋 • • • 弋 j P -i -< j P = jo- 

Consider the shortest such cycle, and renumber the subscripts so that j p = 
max{jo, …， jp}. We cannot have p > 2, because ( 100 ’）implies j p 一 ‘2 j P , yielding 
a shorter cycle. Hence p = 1; but that contradicts ( 99 ). 

230. Call the axioms Ij, Tijk, and Mj m as in the text. If is omitted, let Xij = [j = jo] 

for all i and j. If Ti 0 j 0 k 0 is omitted, let Xij = [j G A] for dll i ^ A = {< 0 , jo ， fo)}; also 
let Xi 0 j = [J. = Jo ]，^joj = [j = and (if io ¥ 众 0 ) = [j = ^ 0 ]• Finally，if MjQ m 

is omitted，let Xij = [pi <Pj]^ where pi.. .p m = 1 ••- (jo—l)(jo+l) … mjo. (The same 
construction shows that the clauses of answer 228 are minimally unsatisfiable-) 

231. Since Gn = Mi m , we can assume that j > 1. Then ◊ Ij 一 [ 

And if 1 < i < j — 1 we have Gij = (• • • o Aijj) o <>•••) oAij m ^ where 

Aijk = oT^j_i^ k = Gij V 元 (j 一 作 . These clauses make it possible to derive 

Bij =( … ((Gij oTjij) oT^(j + i)) ◊… ） oTji m = Gjj V Xji for 1 < i < j, from which we 
obtain Gjj = (• • • ◊ Bij) ◊ B* 2 j ) ◊•••)◊ • Finally Gmm ^ Imm = 
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232. It suffices to exhibit a backtrack tree of depth 6 lg ^ + 0(1). By branching on at 
most 6 variables we can find the color-triplet ai in answer 176(c)- 

Suppose we know that aj = a and cxj-^p = a’，where a cannot be obtained 
from a in p steps; this is initially true with j = 1, a = c/ = ai，and p = q. If 
p = 1， a few more branches will find a contradiction. Otherwise at most 6 branches 
will determine where l = j + Lp/ 2 」； and either ai will be unreachable from a in 
[p/2 」 steps, or a will be unreachable from ai in [p/2] steps, or both. Recurse- 

233. C9 = C4 o C^ 8 , Cio = Ci o C 9 , Cn =0^0 Cio, C12 = Cq o fto, C12, = C7 o Cn, 
C14 = C3 O Ci2^ C15 = C13 O Ci 4 ^ Ci 6 = C2 O Ci 5 ^ C17 = C4 O Ci 5 ^ Cig = Cg O Ci 5 ^ 
C19 = Cl‘2 ◊ Cl 7 ^ C20 = Cll O Cl 8 ^ C2I = Cl 6 O Cl 9 ^ C22 = C20 O ^21. 

234. Reply Xjk ^ * to any query that doesn’t allow the Prover to violate ( 107 ). Then 
the Prover can violate ( 106 ) only after every hole has been queried. 

235. Let C(k,A) = (V ^ =0 a 以 ^ja), so that C(0, {1,, m}) = (x 0 i V ... V x 0m ) and 

C(m^ 0) = e. The chain consists of k stages for A: = 1 ， •… ， m, where stage k begins by 
deriving the clauses Xka V C(k — 1, A) from the clauses of stage k _ 1, for all (m — k) - 
element subsets A of { 1 ，• • • ， m}\a; every such clause requires k resolutions with ( 107 ). 
Stage k concludes by deriving C(k, A) for all (m — A:)-element subsets A of {1， ••” m }， 
each using one resolution from ( 106 ) and k — 1 resolutions from the beginning of the 
stage. (See ( 103 )-) Thus stage k involves a total of (k 2 + k) resolutions. 

For example, the resolutions when m = 3 successively yield 1102 03， 12 0103, 
13 0102; 01_02 1112, 0j_03 1113,_02 03 12 13 _(stage 1);_2T02 1112,_2l02 12, 210311 13, 
2T0313, 22 01 1211, 2201 11, 22 031213, 22 0313, 2301 1311, 23 01 11, 23 021312, 
23 0212; 01 112122, 01 1121, 0212 22 23, 0212 22, 03 13 23 22, 03 13 23 (stage 2); and 
3T1121, 3l21, 31, 3212 22, 32 22, 32, 3313 23, 33 23, 33; 32 33, 33, e (stage 3). 
[Stephen A. Cook constructed such chains in 1972 (unpublished).] 


236• The symmetry of the axioms should allow exhaustive verification by computer 
for m = 2， possibly also for m = 3. The construction certainly seems hard to beat- 


Cook conjectured in 1972 that any minimum - length resolution proof would include, for 
every subset S of {1 ， •…, m}, at least one clause C such that 1J ±T keC {k} = S. 

237. The idea is to define yjk = ^/c V (xj m A x m k) for 0 < j < m and 1 < k < thus 
reducing from m pigeons to m — 1- First we append 6 (m — 1) (m — 2) new clauses 


(Xjm V 巧 /c) 八 (^mk ^^jk) ^ (^jm V X m }^ ^ Zj]^) A (Xj/^ ) 八 {jjjk V 巧 /c) 八 （巧 /c V Vjk V ^jk) ^ 

involving 2 (m — 1 ) (m — 2 ) new variables yjk and Zjk* Call these clauses Ajk^ ••” Fj^. 

Now if Pj stands for ( 106 ) and Hij^ for ( 107 ), we want to use resolution to derive 
Pj = (yji V • • • V |/j( m _i)) and H’ ijk = {y ik V y jk )- First, Pj can be resolved with Dji, 
• •” to get Pj V x jm . Next ， P m o H jmm = x m i V • • • V V x jm can be 


resolved with Gjk = Cjk o Ejk = Xj m V x m k V yjk for 1 < A: < m to get Pj V Xj m . One 
more step yields Pj. (The intuitive “meaning” guides these maneuvers.) 

From B jk o F jk = x jk V x mk V y jk , we obtain Q ijk = x ik V y jk after resolving 
with H ij /- and Then 〈Q ij k ◊ Fi/c) ◊ ^-ik = ^im V Vik V yj /- = Rij k ， say. Finally ， 

(Rjik o Hij m ) oRijk = H[j k as desired- (When forming Rjik we need Qjik with j > i.) 

We’ve done 5m 3 — 6 m 2 + 3m resolutions to reduce m to m — 1. Repeating until 


m = 0 , with fresh y and 2 ： variables each time, yields e after about |m 4 steps. 


[See Stephen A. Cook ， SIGACT News 8, 4 (October 1976) ， 28-32.] 

2 

238. The function (1 — cx)~ x = exp (ex 2 + c 2 x 3 /2 + • • •) is increasing and > e cx . 
Setting c = W = \/2nlnr，and b = \W] makes f < r < p_ b • Also W > w(ao) 
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when n > w(ao) 2 and r > 2; hence w(ao he) < IV + b < V 8n In r + 1 as desired. The 
in the lemma handles the trivial cases that arise when r < 2 . 

(it is important to realize that we don’t change n or VF in the induction proof- 
incident ally, the exact minimum of VF + 6 , subject to r = (1 — VF/(2n)) 一 6 , occurs when 


W = 2 n ( l - e - 2 T ( ’=4 M + ^ + 〜， 6=為=(1訂)(-— | — 長-…)， 

where 2： 2 = (lnr)/( 8 n) and T\z) is the tree function. Thus it appears likely that the 
proof of Lemma B supports the stronger result w(ao h e) < VSn In r — | lnr + 1.) 

239. Let ao consist of all 2 n nontautological clauses of length n. The shortest refutation 
is the complete binary tree with these leaves, because every nontautological clause must 
appear. Algorithm A shows that 2 n — 1 resolutions suffice to refute any clauses in n 
variables; hence ||ao 卜 e|| = 2 n — 1 , and this is the worst case. 

240• If A / has t elements and dA f has fewer than the sequence of bt integers fij 
for its neighbors must include at least 2t repeats of values seen earlier. (In fact there 
are at least + 1 repeats, because 2t would leave at least t in the boundary; but the 
calculations are simpler with 2t, and we need only a rather crude bound.) 

The probability pt that some such A 1 exists is therefore less than ( m ^) (g) ( 吾尸， 
because there are (J 71 ^ 1 ) ways to select (^) to select the repeating slots, and at most 
(3t) 2t out of m 2t ways to fill those slots. Also ( m J l_1 ) = (^) + ( t ^ 1 ) < 2 (^) when t < |m. 

By exercise 1.2.6-67 we have pt < 2( 平 )’( 脊 ) 2 ’ ( 崇 = 2{ct!rrif, where c = 
225e 3 /4 ^ 1130. Also po = pi = 0. Thus the sum of pt for t < m/3000 is less than 
2 (c/3000” s .455; and the probability of strong expansion exceeds -544. 

241. If 0 < |A’| < m/3000, we can put one of its elements into a hole bk G dA f . Then 
we can place the other elements in the same way, since bk isn’t their neighbor. 

242. The proof of Theorem B remains valid when these new axioms are added. 

243. (a) The probability that F f has t elements and V(F f ) has fewer than t is at most 

(T) (t)(n) 3t — • The sum of this quantity for 1 < ^ < lg n is 0 (n -1 ), and so is 

the sum for lg n < t < n/(2ae 2 ). 

(b) If the condition in (a) holds ， there’s a matching from F f into V(F f )^ by 
Theorem 7.5.1M; hence we can satisfy F f by assigning to its variables, one by one. If 
F is unsatisfiable we’ll therefore need to invoke more than n/(2ae 2 ) of its axioms. 

(c) The probability pt that F r has t elements and 2\V(F f )\ — 3|_F’| < \\F r \ is at 

most ( 7 ) (ae 1 +A A 3 — A ("n) 1/4 )、;whereA = *. We have (e 1 +A A 3_A ) 4 < 10 6 ; 

so pt < c l when t < W, where c < 1 ， and ^2t =n f/ 2 Pt is exponentially small- 

(d) Since n < n/(2ae 2 )， every refutation a.s. contains a clause C with n /2 < 

fi(C) < n . The minimal axioms F f on which C depends have \F f \ = Let k 

be the number of “boundary” variables that occur in just one axiom of F f . If v is 
such a variable, we can falsify C and the axiom containing while the other axioms 
of F f are true; hence V must contain v or v. We have \V (F 7 )| = k + |nonboundary| < 
k + 1(31^1 — k)^ because each nonboundary variable occurs at least twice. Therefore 
k > 2\V (F 7 )! — 3|F 7 | > n’/4, q.s. (Notice the similarities to the proof of Theorem B.) 

244. We have [AU B}° = [A}°[B}° U [A} 1 ^} 1 and [AU B} 1 = [A] 0 ^} 1 U [^] 1 ^] 0 , 
where concatenation of sets has the obvious meaning. These relations hold also when 
A = 0 or B = 0 ， because [ 0 ]° = {e} and [ 0] 1 = 0 . 
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245. (a) When conditioning on e ⑽， simply delete the edge u —— v from G. When 
conditioning on e uv ^ also complement l{u) and l{v). The graph might become discon¬ 
nected; in that case, there will be exactly two components, one even and one odd, with 
respect to the sums of their labels. The axioms for the even component are satisfiable 
and may be discarded- 


For example, a{G) \ {b^ e} corresponds to while a(G) | { 6 , e} corresponds 

to<>- We toss out the left component in the first case, the right one in the other. 

(b) If (7 G a(v) we may take V / = {t;}- And we have //(e) = \V\^ because the 


axioms UtevAu a ( v ) are satisfiable for all u G V. 

(c) If u ^ V f and v 味 V， , there’s an assignment that falsifies C and some axiom 
of a(u) while satisfying all a(w) for w ^ V f \ because \V f \ is minimum. Setting 
e U v e U v will satisfy a(u) without affecting the axioms a(w) (which don’t contain e U v)- 


(d) By (b), every refutation of a(G) must contain a clause C with < ii{C) < 
|m. The corresponding V' has |y’|/(|F’| + | dV '\) < ( 暑 + 8)/9, hence \dV'\ > 盖 |F’|. 

[Property (i) is interesting but irrelevant for this proof- Notice that a{G) has 
exactly |n s 2.67n 3SAT clauses in n = 3m/2 variables when G is cubic; every 
literal occurs four times- G. Tseytin proved lower bounds for refutations of a{G) by 
regular resolution in 1966, before graphs with property (iii) were known; A. Urquhart 
obtained them for general resolution in JACM 34 (1987), 209—219，and the simplified 
argument given here is due to Ben-Sasson and Wigderson. The fact that a(G) requires 
exponentially long refutation chains，although the same axioms can be refuted easily 
by working with linear equations mod 2 , amounts to a proof that backtracking is a poor 
way to deal with linear equations! Suitable Ramanujan graphs raman(2 ， q,3 ， Q) are part 
of the Stanford GraphBase for infinitely many prime numbers q. We can also obtain 
the same lower bounds with the multigraphs mman ( 2 , g ， 1 ， 0 ) and raman ( 2 , 2 , 0 ). 

Section 7.4.3 will explore expander graphs in detail.] 


246. Let’s write [ai ... ak] 1 for what exercise 244 calls [{ai， ••” a/c}]' With new 
variables z we can introduce {xa^ xb, xab^ ya, yb, yab^ zx^ zy, zxy} and resolve those 
clauses to [zab ] 1 , which means 2 ： = a ® 6. So we can assume that i l a ® & is a 
legal primitive operation of “extended resolution hardware,” when 2 ： is a new variable- 
Furthermore we can compute ai ® • • • ® a/c in O(k) steps，using zo <— 0 (which is the 
clause [zo] 1 ，namely zo) and Zk <— Zk-i ㊉ a/c when A: > 1. 

Let the edge variables E(v) be ai， …， where d is the degree of v. We compute 
s v l ai ㊉ • ••㊉ aj by setting 5^,0 0, s v ,k s v ^k-i ® a/c，and s v We can 

resolve s v with the axioms a{v) in 0{2 d ) steps，to get the singleton clause [〜]…)® 1 , 
meaning s v = £(v). Summing over these operations therefore take O(N) steps. 

On the other hand, we can also compute z n s v and get zero (namely c z n ^). 

Doing this cleverly, by omnisciently knowing we can in fact compute it in 0{mn) 
steps: Start with any vertex v and set 2:1 s v (more precisely, set zi^k s v ，k for 
0 < v < d)- Given Zj for 1 < j < n, with all its sub variables Zj ， k, we then compute 
Zj+i 4 •㊉ where u is the unused vertex with 5^,1 = We can arrange the edges 
into an order so that if Zj has p edge variables in common with s u ^ then = s u ,k for 
1 < k < p. Suppose the other variables of Zj and s u are respectively ai, a q and 
61 , … ， 6 r ; we want to merge them into the sequence ci, …， c q ^- r that will be needed 
later when Zj+i is used. So we set Zj+ 1^0 0, Zj^i^k ^ Zj+i^k-i ㊉ c/c ， Zj+i 々 +i， g + r . 

From the clauses constructed in the previous paragraph 5 resolution can deduce 
[zj.kSu.k] 1 for 1 < k < p, and hence [zj+i.oZj^Su.p] 1 (namely that zj+i^o = 巧 ，: p ㊉ s u ^ p ). 
Furthermore, if Ck = and if we know that = 么心 ㊉ where s = p + i — 1 
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and t = p + k — i, resolution can deduce that = 之 j,s+i ㊉ a similar formula 

applies when c/c = bi. Thus resolution yields 2 ： j+i ~ ㊉ as desired. Ultimately we 

deduce both 2 ： n and 乏 n from the singleton clauses s v = £(v). 

247 . Eliminating X 2 from {12,12,12} gives {!}; eliminating x\ then gives 0. So those 
five clauses are satisfiable. 


248. We have F(xi, …， x n ) = (x n V Ai) A ••• A (x n V A f p ) A {x n V A f {) A ••• A (无 n V A^) A 
A;" 八 ...AA;" = (x n VG ,, )A(x n VG f,, )AG ,/,/ , where G' = A[a^G n = AIa-^AA^ 
and G ,n = A'i A • • • A A^' depend only on {xi ， … ， x n -i}. Hence F' = {G' V G") A G "'; 
and the clauses of G r V G n = 八 f =1 Vare the resolvents eliminating x n . 


249. After learning C 7 = 23 as in the text, we set d ^ 2 ^ h 2 , Cj = 23 , learn 
Cg = 3 , and set d ^ 1^ h ^ 3. Then I 2 ^ 4 (say); and 1 3 ^ U 2. Now C{ = 1234 
has been falsified; after Z 4 卜 2 and Cj = 12 we learn C 9 = 134 , set Z 3 卜 1 , and learn 
C 10 = 134 o 13 = 34 . Finally I 2 4 , we learn Cn = 3; h <— 3 , and we learn C 12 = e. 


250* l\ i — 1 5 I 2 '~ 3 ， Z3 i — 2 ， Z4 i — 4; learn 123^ I 3 i ~ 2 ， Z4 i — 4; learn 123 and 13; 
I 2 i ~ 3 5 Is i — 2, I 4 i — 4* learn 123^ I 3 i ~ 2, I 4 i — 4* learn 123, 13, I5 li i ~ 1 5 I 2 人 ~ 3, 
Is ^~ 4 ， Z4 i — 2; learn 134; Z3 i — 4 5 I 4 . i — 2 5 I 4 i — 2. 

251. Algorithm I has the property that h 15 • • • ， U h _ 1 ^ h k are on the stack whenever 
the new clause h 1 V • • • V Z4 has been learned, if ii <•••<“= d and step 14 returns 
to 12. These literals limit our ability to exploit the new clause; so it appears to be 
impossible to solve this problem without doing more resolutions than Stalmarck did. 

However, we can proceed as follows- Let M[ f mk be the clause x m i V • • • V x m ^-i) V 
Xik V • • • V V Xim^ for 1 < i^k < m. Using ij to stand for Xij, the process for 

m = 3 begins by putting 11 ， 12 ， 13 ， 21 ， 22, 23 ， 31 ， 32， 33 on the stack. Then step 13 
has Ci = I3, step 14 has Cj = M33 ； so step 15 learns I3 o M33 = M32 - Step 14 now 
changes 32 to 32 and chooses Cj = T232 ； so 15 learns M32 o T 232 = M 2 S 2 - Step 14 
changes 31 to 31 and chooses Cj = T231 ； now we learn -^232 o ^23i = -^231 - Next, we 
learn o M 23 = M 22 ； and after changing 22 to 22 we also learn M 21 . 

The stack now contains 11 ， 12, 13 ， 21. We add 31, 32， and proceed to learn 
M32 o T132 = Mf 32 ， M (’ 32 oTi3i = Mf 31 ， M^ 31 o M13 = M12. The stack now contains 
11 ， 12, and we’ve essentially reduced m from 3 to 2. 

In a similar way, 0(m 2 ) resolutions will learn for < = m — 1, … ， 1; and 

they’ll leave xu, • • • ， Xi( m _ 2 ) 5 尤 i(m_i) on the stack so that the process can continue. 


252. No; large numbers of clauses such as xi ‘2 V x ‘23 V • • • V xg 9 V X 19 are generated by 
the elimination process- Although these clauses are valid, they’re not really helpful. 

Exercise 373 proves ， however，that the proof is completed in polynomial time if 
we restrict consideration to the transitivity clauses of exercise 228(!). 

253• A conflict arises when we follow a chain of forced moves: 




2 


4 


L t 

6 

4 



level reason 

1 A 

1 46 

2 A 

2 345 

2 369 


t Lt level reason 

5 7 2 §79 

6 I 2 159 

78 2 678 

8 2 2 123 

9 2 2 258 


Now 258 258ol23 = 1358 13567 35679 3569 356 456; so we learn 456 

(which can be simplified to 56, because 4 is “redundant” as explained in exercise 257)- 
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Setting L2 5, with reason 456 or 56, now forces 7, 1, 3, 9, 2, 8, 8, all at level 1; 
this conflict soon allows us to learn the unit clause 6 - (Next we’ll inaugurate level 0, 
setting Lo <— 6 - No “reasons” need to be given at level 0.) 

254. Deducing 3, 2, 4, 4 at level 1， it will find 24o43 = 23 and 23 o23 = 3, learning 3. 
(Or it might learn 3 after deducing 2.) Then it will deduce 3, 1 ， 2, 4 at level 0. 

255. For example, {124,235,456,456}. [Since the clause c that is learned by the 
procedure described in the text contains just one literal l from the conflict level d ， 
the trail position for l has been called a “unique implication point” (UIP). If l isn’t 
the decision literal for its level，we could resolve c with Vs reason and find another 
UIP; but each new resolution potentially increases the b array and limits the amount 
of backjumping. Therefore we stop at the first UIP.] 

256. If it is false, literals 50, 26, • • •, 30 are true; hence also 25, 23, and 29, a conflict. 
Consequently we can obtain by starting with 23 26 … • 50 and resolving with 23 25 27, 
25 27 29, and 25 30 … 70- [Similarly, and more simply, one can learn ( 122 ) by resolving 
lll6 ...56 with 316191, 4166 91, and 56 6166.] 

257. (a) Suppose V on level d f > 0 is redundant. Then some /〃 in the reason for l f is 
also on level d!] and Z" is either in c or redundant. Use induction on trail position. 

(b) We can assume that the stamp value s used when resolving conflicts is a 
multiple of 3, and that all stamps are < s. Then we can stamp literal l with S(|/|) 

5 + 1 if / is known to be redundant, or s + 2 if Z is known to be nonredundant and not 
in c. (These stamps serve as a “memo cache” to avoid repeated work.) While building 
c we can also stamp levels as well as literals，setting LS \_d!~\ 5 if level d! has exactly 

one of the 6“ or s + 1 if it has more than one- 

Then for 1 < j < r, bj is redundant if and only if LS Uev(bj)^] =5 + 1 and 
red(bj) is true, where lev{l) = VAL(|Z|) 》 1 and where red{l) is the following recursive 
procedure: a If l is a decision literal, return false. Otherwise let (/ V ai V • • • V dk) be Vs 
reason. For 1 < i < k with lev (aj) > 0 ， if S(|a《|) =5 + 2 return false; if S(|a《|) < s 
and either LS Uev (a^)] < 5 or red (d 。 is false, set S(|a《|) 4-5 + 2 and return false. But 
if none of these conditions hold，set S (|/|) ^5 + 1 and return true.” 

[See Allen Van Gelder ， LNCS 5584 (2009) ， 141—146.] 
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258. That statement is true in Table 3, but false in general. Indeed, consider the 
sequel to Table 3: The decision L 44 = 57 causes the watch list of 57 to be examined, 
thus forcing 15, 78, and 87 (among other literals) in some order because of the clauses 
15 57 36， 78 57 36， 87 57 27. Then 96 will be forced by the clause 96 87 •.. 15; and the 
second literal of that clause at the time of forcing will be 15, regardless of trail order ， 
if the watched literals of that clause were 96 and 15 (making it invisible to 78 and 87). 

259. 1 + p 6 + p 7 < p + p 2 when -7245 < p < .7548. (There can in fact be any number 
of crossover points: Consider the polynomial (1 — p — p 2 )(l — p 3 — p 6 )(l — p 9 — p 18 )-) 

260. First，to get a random permutation in the heap we can use a variant of Algo¬ 
rithm 3-4.2P: For A: 1, 2 ， ••” n, let j be a random integer in [0. • A: — 1] and set 
HEAP Ik - 1] — HEAP[j] ， HEAP [j] — A:. Then set HL0C (HEAP [j] ) j for 0 < j < n. 

Next，set F 0 and W/ 0 for 2 < / < 2n + 1 and c ^ 3. Do the following 
for each input clause loh …1 卜 1: Terminate unsuccessfully if A: = 0， or if A: = 1 and 
0 < VAL(|/ 0 |) ^ /o&l- If k = 1 and VAL(|Z 0 |) < 0, set VAL(|/ 0 |) I Zo&l ， TL0C(|Z o |) l 
F ^ F + 1. If A: > 1， set MEM [c + j] lj for 0 < j < A:; also MEM[c — 1] t k, 

MEM [c — 2] Wi 0 ^ 奶 0 1 c，MEM [c — 3] i— c, c c + A: + 3. 
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Finally, set MINL MAXL —• c+2 (allowing two cells for extra data in the preamble 

of the first learned clause). Of course we must also ensure that MEM is large enough. 

261. (Throughout this answer, lj is an abbreviation for MEM [c + j].) Set q ^ 0 and 
c Wf. While c — 0， do the following: Set l f ^ lo. li l r ^ l (hence h = l)^ set 
c ^r- Z— 3 ; otherwise set I’ l h ， lo I’ ， h l I ， c’ l 一 2, 1-2 ^r- l 一 3, and I-3 ^ c . If 
VAL(|/o|) > 0 and VAL(|/o|) + lo is even (that is, if lo is true) 5 perform the steps 

if g — 0, set MEM lq — 3] t c，else set W\ ^r- c; then set q c. (*) 

Otherwise set j 4- 2; while j < l 一 1 and VAL(|/j|) > 0 and VAL(|/j |) + lj is odd，set 

j j+1. If now j < Z—i，set h ^r- lj^ lj ^r- ^r- c. But if j = Z 一 1 ， do (*) 

above; jump to C7 if VAL(|/o|) > 0; otherwise set Lf ^ lo^ etc- (see step C4) and c c . 
Finally，when c = 0, do (*) above to terminate fs new watch list- 

262. To delete k = HEAP [0] in C6: Set h t h — 1 and HL0C(A0 < - 1. Stop if /i = 0. 

Otherwise set i *<— HEAP [/i] ， a ACT (i) ， j 1 0， / 1 1， and do the following while 

j < h: Set a 7 ACT (HEAP [/]); if / + 1 < /i and ACT (HEAP [/ +1]) > a, set j 

/ + 1 and a’ ACT (HEAP [/] ); if a > a 7 5 set f otherwise set HEAP [j] l HEAP [/] ， 
HL0C(HEAP [/]) l j，j l j’，and j’ l 2j + l. Then set HEAP [j] i and HL0C(i) ^ j. 

In C7, set A: |Z|，a ACT(A0 ， ACT(A0 l a + DEL，j l HL0C(A:)，and if 

j > 0 perform the “siftup” operation: “Looping repeatedly, set / 卜 （j — 1) 》 1 and 
i HEAP[j’]，exit if ACT(i) > a, else set HEAP[j] —• i ， HL0C ⑴ l j，j /， and exit 
if j = 0- Then set HEAP [j] k and HL0C(A0 j.” 

To insert A: in C8, set a ACT (AO ， j /i, /i l /i + 1; if j = 0 set HEAP [0] < — k 
and HL0C(A:) <— 0; otherwise perform the siftup operation. 

263. (This answer also sets the level stamps LS Id] needed in answer 257， assuming 

that the LS array is initially zero.) Let “bump r mean “increase ACT(|/|) by DEL” as 
in answer 262. Also let be the following subroutine: “If S(|/|) = s, do nothing. 

Otherwise set S(|/|) 5, p *f- lev (J)• Ifp > 0, bump l] then if p = set q i- q^ 1] else 

set r r + 1 ， 6 r 1 『， d’ 4 - max(d’ ， p)，and if LS [p] < 5 set LS Ip] 5 + [LS Ip] = 5].” 

When step C7 is entered from C4, assuming that d > 0^ set d! 卜 g 卜 r 卜 0, 
s t s + 3 ， S(|/o|) bump Zo，and do blit(lj) for I < j < k. Also set t ^r- 

max(TL0C(|h I) ” • • ， TL0C(|4_i I)). Then, while q > 0, set l l Lt, t ^ t — 1; if 
S(|/|) = s then set g l g — 1， and if i?/ — A let clause Ri be loh - - - lk-i and do blit(lj) 
for 1 < j < k. Finally set / 7 and while S(|r|) ★ s set t <— t — 1 and l f ^ Lt. 

The new clause can now be checked for redundancies as in answer 257 - To install 
it during step C9 ， there’s a subtle point: We must watch a literal that was defined 
on level d!. Thus we set c MAXL, MEM[c] l 『’， A: l 0 ， / l 1; and for 1 < 
j < r if S(|6j|) = 5 set A: *<— A: + 1 and do this: If j 1 = 0 or lev{\bj\) < d\ set 
MEM[c + A: + /] ^r- bj^ otherwise set MEM[c + 1] l l 0 ， MEM [c — 2] 

Wp c, MEM[c—3] ^ Wi. c. Finally set MEM [c —1] A: + l, MAXL ^ c +A: + 6. 

3 3 

264. We can maintain a “history code” array, setting to 0, 2, 4, or 6 when Lf 
is set，and then using H t + (Lt Sz 1) as the move code that represents trail location t 
ior 0 < t < F. History codes 6 ， 4， and 0 are appropriate in steps Cl, C4，and C6, 
respectively; in C9, use code 2 if l r was a decision literal, otherwise use code 6. 

[These move codes do not increase lexicographically when the trail is flushed and 
restarted; hence they don’t reveal progress as nicely as they do in the other algorithms.] 

265. (1) A literal Lt on the trail with G <t < F has become true，but the watch list 
of Lt has not yet been examined- (2) If lo is true, so that c is satisfied，step C4 doesn’t 
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remove c from the watch list of h when h becomes false. (This behavior is justified ， 
because c won’t be examined again until h has become free during the backtracking 
step C 8 -) (3) A clause that becomes a reason for lo remains on the watch list of its 
false h. (4) During a full mn, a clause that triggers a conflict is allowed to keep both 
of its watched literals false. 

In general, a false watched literal must be defined at the highest trail level of all 
literals in its clause. 

266. If U < p, where [/ is a uniform deviate between 0 and 1, do this: Set j to a 
random integer with 0 < j < and k *<— HEAP [j] • If j = 0, or if VAL(A:) > 0, use the 
normal C 6 . Otherwise branch on k (and don’t bother to remove k from the heap). 

267. As in Algorithm L, let there be a sequential table BIMP(D for each literal 
containing all literals l f such that 『 V Z’ is a binary clause. Furthermore, when the 
propagation algorithm sets Lf 卜 " because l f G BIMP(D，we may set 卜 —l, 
instead of using a positive clause number as the “reason •” (Notice that a binary clause 
therefore need not be represented explicitly in MEM, if it is represented implicitly in 
the BIMP tables. The author’s implementation of Algorithm C uses BIMP tables only 
to expedite binary clauses that appear in the original input. This has the advantage 
of simplicity，since the exact amount of necessary space can be allocated permanently 
for each table. Learned binary clauses are comparatively rare in practice; thus they 
can usually be handled satisfactorily with watched literals, instead of by providing the 
elaborate buddy-system scheme that was important in Algorithm L.) 

Here, more precisely, is how the inner loop goes faster with BIMPs. We want to 
carry out binary propagations as soon as possible, because of their speed; hence we 
introduce a breadth - first exploration process analogous to ( 62 ): 

Set H F; take account of l f for all l f G BIMP(/o )； 
while H < set lo <— Rh, H ^ H + 1^ and (**) 

take account of l f for all l f G BIMP(/o)- 

Now “take account of Z’” means “if 广 is true, do nothing; if l f is false, go to C7 

with conflict clause l W l f ; otherwise set Lf ^ ^ TL0C(|"|) l F ， VAL(|"|) ^ 2d + 

("& 1) ， R[f < - Z ，F ^ F + 1.^ We do (**) just before setting c *<— c in answer 261. 

Furthermore, we set E F just after G i- 0 in step Cl and just after F F + 1 in 
steps C 6 and C9; and ifG<E after G <— G + 1 in step C4, we do (* 氺 ） with lo <— L 

Answer 263 is modified in straightforward ways so that “clause is treated as 
if it were the binary clause (/ V V) when Ri has the negative value —l f . 

268. If MEM [c — 1] = A: > 3 is the size of clause c ， and if 1 < j < A:，we can delete 

the literal l in MEM [c + j] by setting k l k — 1, MEM [c — 1] i- l f i- MEM[c +A :]， 

MEM [c + j] l Z’ ， and MEM [c + k~] l Z + /， where / is a flag (typically 2 31 ) that 
distinguishes a deleted literal from a normal one. (This operation does not need to 
be done when the current level d is zero; hence we can assume that k > 3 and j > 1 
before deletion. The flag is necessary so that global operations on the entire set of 
clauses, such as the purging algorithm, can pass safely over deleted literals. The final 
clause in MEM should be followed by 0, an element that’s known to be unflagged.) 

269. (a) If the current clause contains a literal l = Lt that is not in the trivial clause, 
where t is maximum, resolve the current clause with Rj and repeat. 

(b) (wi V bj) A (/j V _ 1 V bj ) for 1 < J < 9, (lo V U 2 V us) A (Iq V fs V 610 )； l' = lo- 

(c) If r > d 7 + r, where r is a positive parameter, learn the trivial clause instead 
of (P V 61 V • • • V 6 r )- (The watched literals should be V and .) 
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Notice that this procedure will learn more than simple backtrack a la Algorithm D 
does, even when the trivial clause is always substituted (that is, even when r = —oo), 
because it provides for backjumping when d! < d + 1. 

270. (a) Consider the clauses 32, 432, 5431, 6541, 654, with initial decisions Li l 1, 
Li •(— 2. Then L 3 3 with reason R 3 ^ 32; similarly La ^ 4, L 5 5. If Lq 4 - 6 , the 
conflict clause 654 allows us to strengthen Rq to 541; but if Lq •<— 6 , with Rq 654, we 
don’t notice that 6541 can be strengthened. In either case we can, however, strengthen 
R 5 to 431, before learning the clause 21. 

(b) After doing blit(lj) to the literals of Ri, we know that Ri \l is contained 
in {bi^... ^b r } together with ^ + 1 unresolved false literals that have been stamped at 
level d. (Exercise 268 ensures that p 7 ^ 0 within each blit.) Thus we can subsume 
clause Ri on the flyif^ + r + l<A: and ^ > 0- 

In such cases the procedure of answer 268 can be used to delete l from c = Ri. But 
there’s a complication, because / = / 0 is a watched literal (j = 0 in that answer), and all 

other literals are false. After l is deleted, it will be essential to watch a false literal l f that 

/ 

is defined at trail level d. So we find the largest f < k such that VAL(MEM[c + j 7 ] ) > 2d, 
and we set Z’ l MEM [c + /] • If j’ — A:, we also set MEM [c + /] l MEM [c + A;] ; we can 
assume that j > 1- Finally, after setting MEM[c] l f and MEM [c + k~] ^ / as in 

answer 268, we also delete c from the watch list W/，and insert it into W//. 

[This enhancement typically saves 1%-10% of the running time，but sometimes 
it saves a lot more. It was discovered in 2009， independently by two different groups 
of researchers: See H. Han and F. Somenzi, LNCS 5584 (2009), 209-222; Y. Hamadi, 
S. J abb our, and L. Sais ， Int. Conf. Tools with Artif. Int. (ICTAI) 21 (2009) ， 328-335-] 

271 • We shall check for discards only if the current clause Ci is not trivial (see exercise 
269), and if the first literal of Ci 一 i does not appear in the trail. (Indeed, experience 
shows that almost every permissible discard falls into this category.) Thus, let ft-i be 
loh … lk-i where VAL(|/o|) < 0; we want to decide if {Z’ ， 6 i ， … , 6 r } C {Zi ， … ， 4-i}. 

The secret is to use the stamp fields that have already been set up. Set j l k — 1, 
q 4 - r + 1, and do the following while q > 0 and j > q: If lj = 『’， or ifVAL(|/j|) < 2c?’+ 1 
and S (I / j I) = s ， set g 卜 g — 1; in any case set j ^ j — 1. Then discard if ^ = 0. 

272. Reflection isn’t as easy to implement as it may seem, unless (7 is a unit clause, 
because C R must be placed carefully in MEM and it must be consistent with the trail- 
Furthermore, experience shows that it’s best not to learn the reflection of every learned 
clause, because excess clauses make unit propagation slower. The author has obtained 
encouraging results ， however，by doing the following operations just before returning 
to C3 in step C9, whenever the length of C doesn’t exceed a given parameter R: 

Assign ranks to the literals of C R by letting rank(/) = oo if Z is on the trail ， 
rank(/) = d n if l is on the trail at level d n < d!, rank(/) = d otherwise. Let u and v 
be two of the highest ranking literals，with rank(?x) > rank(v). Put them into the first 
two positions of C R ^ so that they will be watched. Do nothing further if rank(^) > d!. 
Otherwise，if rank(v) < d!, back jump to level rank(v) and set d! 4 - rank(v). Then if 
rank ( 乜 ) =rank(v) = d\ treat C R as a conflict clause by going to step C7 with c l C R • 
(That is a rare event, but it can happen.) Otherwise, if u doesn’t appear in the current 
trail, set Lf TL0C ( 卜 |) ^ R u <— C R , F F + 1. (Possibly F = E + 2 now-) 

(For example, this method with R ^ 6 roughly halved the running time of 
waerden (3,10; 97) and waerden (3,13; 160) with parameters ( 193 ) except for p i— -995-) 
A similar idea works with the clauses langford(n), and in general whenever the 
input clauses have an automorphism of order 2. 
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273. (a) We can convert Algorithm C into a “clause learning machine” by keeping the 
process going after F reaches n in step C5: Instead of terminating, start over again by 
essentially going back to step Cl, except that the current collection of clauses should 
be retained, and the OVAL polarities should be reset to random bits. Learned clauses of 
size K or less, where K is a parameter, should be written to a file. Stop when you’ve 
found a given number of short clauses, or when you’ve exceeded a given time limit- 

For example, here’s what happened when the author first tried to find W (3,13): 
Applying this algorithm to waerden (3,13; 158) with K = 3^ and with a timeout limit of 
30 G" (gigamems), yielded the five clauses 65 68 70, 68 78 81, 78 8190, 78 79 81, 79 81 82. 
So fifteen clauses 65 68 70, 66 69 71, … , 81 83 84 could be added to waerden (3^ 13; 160 )， 
as well as their fifteen reflections 96 93 91, 95 92 90，…, 80 78 77. Then the algorithm 
“C R ”of exercise 272 proved this augmented set unsatisfiable after an additional 107 Gfi. 
In a second experiment, using K = 2 with waerden (3^ 13; 159) led to three binary 
clauses 76 84, 81 86, and 84 88. Shifting and reflecting gave twelve binary clauses, which 
in company with waerden (3,13; 160) were refuted by C R in another 80 G/x. (For com¬ 
parison, Algorithm C R refuted waerden (3,13; 160) unaided in about 120 G/n, compared 
to about 270 Gfi for both Algorithm C and Algorithm L.) Optimum strategies for learn¬ 
ing useful clauses from satisfiable subproblems are far from clear, especially because 
running times are highly variable. But this method does show promise, especially on 
more difficult problems — when more time can be devoted to the preliminary learning. 

(b) Short clauses that can be learned from satisfiable instances of, say, Xo 
Xi 4 4 when Xo is not required to be an initial state, can be shifted and 

used to help refute Xo —> X\ 4 ••• 4 

274. With care, circular reasoning can (and must) be avoided. But the author’s 
elaborate experiments with such ideas (and with the related notion of “better con¬ 
flicts^ )were disappointing; they didn’t beat the running time of the simpler algorithm. 
However 5 an intriguing idea by Allen Van Gelder [Journal on Satisfiability，Boolean 
Modeling and Computation 8 (2012), 117-122] shows promise. 

275. When a solution has been found, let k be minimum such that x/c = 1 and the 
value of Xk has not been assigned at level 0. If no such k exists, we stop- Otherwise 
we are entitled to force variables x\ through Xk^i all to have their current values, at 
level 0, because we know that this doesn’t produce an unsatisfiable problem. So we fix 
those values, and we restart the solution process at level 1 with the tentative decision 
c Xk = If a conflict occurs, we’ll know that x/c = 1 at level 0; if not, we’ll have a 
solution with Xk = 0. In either case we can increase k. (This method is considerably 
better than that of answer 109, because every learned clause remains valid.) 

276 . True. Unit propagation essentially transforms F A L into F\ L. 

277 . Otherwise F A (7i A • • • A Ct-\ hi e fails (unit propagation wouldn’t start)- 

278. For example, (46, 56, 54, 6,4, e). (Six steps are necessary.) 

279. True, because the dependency digraph contains a literal l with l ― f l ― f l. 

280 . (a) They’re satisfied if and only if xi .. .x n has at least j 0s and at least k Is. 
[The problem cook(k^ k) was introduced by Stephen A. Cook (unpublished) in 1971.] 

(b) Take all positive (j — t)-clauses on {1， ••” n — 1 —for 尤 = 1 ， 2, •••，』•• 

(c) Suppose the very first decision is Lq <— x n - The algorithm will proceed to act 
as if the input were cook (j, k) \ x n = cook k — 1). Furthermore, with these clauses, 
every clause that it learns initially will include x n . Therefore, by induction, the unit 
clause (x n ) will be learned clause number (;:【)• All previously learned clauses are 
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subsumed by this one, hence they’re no longer relevant- The remaining problem is 

cook ( k)\x n = cook (j — 1 , k); so the algorithm will finish after learning more. 

%) 一 

Similarly, if the first decision is Lq ^ x n ^ the ( n :|)th learned clause will be (x n ). 

3 

281. Stalmarck^s refutation corresponds to the sequence Mj k2 ^ •••， Mj k( ^ k _ 1 ^ 

Mj (/ ,_i)) for j = 1， ••” A: — 1， for A: = m，m — 1， • • • ， 1. can be omitted-) 

282. First learn the exclusion clauses ( 17 ) - In the next clauses we shall write aj, bj, …, 

as shorthand for aj， p , bj， p , • • • ， where p is a particular color，1 < p < 3. Notice that 
the 12q edges appear in Aq triangles，namely {bj’Cj , }, { 〜 • ， ay , bjf {/j ， ey ， ， Cj ，}， 

{ej, fj 、 dj，}, for I < j < where j is j + 1 (modulo q). For every such triangle 
{u^ w}^ learn {u v > \! v v \! w p ) and then (u p \! v v \! w p )^ where p f is p + 1 (modulo 3). 

Now for j = 1 ， 2,… ， g，learn (~V/j Vo^/Vey/) ， (ajVejWayW\ (e^V/j Ve^/V/j/), 
{aj V ej V ej/), (dj V J V /，)，(~ V /；• V a^/), as well as eighteen more: 

(u\ V i；i V Uj V Vj)^ (u 2 V V 2 V Uy V v’y), if j > 3 is odd; 

(公 1 V vi V Uj)^ (U 2 V V 2 V Uj/)^ if j > 3 is even; 

here u^v ^ {a, e, /} and u G {a, e, /} yield 3x3 choices of (u^ v ， u 、 v’）• Then we^re 
ready to learn (dj V e^), (dj W fj), (ej V fj) for j e {1,2} and (aj V ej V fj V a y \ 
(aj V ej V fj ) for j G {1, q}. All of these clauses are to be learned for 1 < p < 3. 

Next, for j = g, g - 1, , 2, learn (dj V e^-), (dj V /j), (ej V fj) for 1 < p < 3 

and then (aj-i V ej-i V fj-i V 〜 • ）， (aj —1 V ej-i V fj-i) for 1 < p < 3. We have now 

established all clauses in the hint. 

The endgame consists of the following for 1 < p < 3: For all choices of p and p n 
with {p ， p’ ， p"} = {1 ， 2, 3} (thus two choices), and for j = 2, 3, … ， g，learn three clauses 

{cli^p V 6 i,p / V cij^p V V ei 5 p / V cij^pf V 6 j,p ) 5 (&i，p V 否 V dj^p n V ej ， 〆) ， j even; 

{cli^p V V cij^p V V Gi^pt V o,j^p f V ) ， {cli^p V 否 i，〆 V dj^p n V Cj ), j odd; 

then learn (ai, p V 6 〆）. Finally learn di^ p . 

[Not all of these clauses are actually necessary* For example, the exclusion clauses 
for 6 ’s, c’s, and d’s aren’t used- This certificate doesn’t assume that the symmetry¬ 
breaking unit clauses 6 i，i 八 ci ， 2 八 d\^ of fsnark (q) are present; indeed, those clauses 
don’t help it much. The actual clauses learned by Algorithm C are considerably longer 
and somewhat chaotic (indeed mysterious); it’s hard to see just where an “aha” occurs!] 

283• A related question is to ask whether the expected length of learned clauses is 
0 ( 1 ) as q 4 oo. 

284. It’s convenient to represent each unit clause (/) in F U (7i U • • • U ft as if it were the 
binary clause (/ V xo), where xo is a new variable that is always true. We borrow some 
of the data structures of Algorithm C, namely the trail array L, the reason array 
and the fields TLOC ， S，VAL associated with each variable- We set VAL(AO = 0, 1， or —1 
when Xk has been forced true, forced false, or not forced, respectively. 

To verify the clause ft = (ai V • • • V a/c)，we begin with VAL (j) 0 for 0 < j < n, 

Lq i — 0 ， Z/i i — cii, •••，_£/& i — cik, E i — F i — A; + 1, G i — 0, and VAL( \Lp \) i ~ Lp & 1 for 
0 < p < F] then we carry out unit propagation as in Algorithm C，expecting to reach 
a conflict before G = F. (Otherwise verification fails.) 

A conflict arises when a clause c = lo ... Ik 一 1 forces lo at a time when lo has already 
been forced- Now we mimic step C7 (see exercise 263)，but the operations are much 
simpler: Mark c, stamp S(|/j|) i for 0 < j < A:, and set p max(TL0C(|Zi |)，• • • ， 
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TL0C(|4-iI)). Now, while p > we set l L p ^ p i- p — 1^ and if S(|/|) = i we also 
“resolve with the reason of r as follows: Let clause Ri be loh … h-u mark Ri, and 
set S (\lj\) ^ i for 1 < j < k. 

[Wetzler, Heule，and Hunt have suggested an interesting improvement, which will 
often mark significantly fewer clauses at the expense of a more complicated algorithm: 
Give preference to already-marked clauses when doing the unit propagations, just as 
Algorithm L prefers binary implications to the implications of longer clauses (see ( 62 )).] 

285. (a) j = 77, 577 = 12 + 2827, 17177 — 59, 677 — 710. 

(b) j = 72, 572 = 12 + 2048, m 72 = 99 + 243 + 404 + 536 = 1282, 672 = 
3 + 40 + 57 + 86 = 186 - (The RANGE statistic is rather coarse when a = ^ because 
many different signatures yield the same value.) 

(c) j = 71, 571 = 12 + 3087, inn = 243, 671 — 40. 

286 • The maximum, 738, is achieved uniquely by the RANGE - oriented solution with 
a = 错 ， except that we can optionally include also the signatures ( 6 , 0) and (7,0) for 
which a pq = 0- [This solution optimizes the worst case of clause selection, because the 
stated problem implicitly assumes that the secondary heuristic is bad. If we assume, 
however, that the choice of tie-breakers based on clause activity is at least as good as 
a random choice, then the expected number 738 + 45 •盖 s 745-6 from a = || is not 
as good as the expected number 710 + 287 - 盖 义 750.5 from a = ^.] 

287. When a conflict is detected in step C7 (with d > 0), keep going as in step C3; 
but remember the first clause Cd that detected a conflict at each level d. 

Eventually step C5 will find F = n. That’s when clauses get their RANGE scores ， 
if we’re doing a full run because we want to purge some of them. (Sometimes, however, 
it’s also useful to do a few full runs at the very beginning, or just after a restart, because 
some valuable clauses might be learned.) 

New clauses can be learned in the usual way from the remembered clauses (7 山 in 
decreasing order of except that “trivial” clauses (exercise 269) are considered only at 
the lowest such level We must keep track of the minimum backjump level d\ among all 
of these conflicts. And if several new clauses have the same d’, we must remember all 
of the literals that will be placed at the end of the trail after we eventually jump back. 

288. Step C5 initiates a full run, then eventually finds F = n. At this point we’re 
done, in the unlikely event that no conflicts have arisen. Otherwise we set LS Id] 0 
for 0 < d < n and mj ^ 0 for 1 < j < 256. The activity ACT (c) of each learned 
clause c has been maintained in MEM [c — 5], as a 32-bit floating point number. The 
following steps compute RANGE (c), which will be stored in MEM [c — 4] as an integer, for 
all learned c in increasing order, assuming that c’s literals are loh … l s -i: 

If Ri 0 = c, set RANGE (c) 0. Otherwise set p r l 0 ， and do the following 

for 0 < A: < 5 ： If i; < 2 and v + Ik is even, set RANGE (c) — 256 and exit the loop on k 
(because c is permanently satisfied，hence useless). If i; > 2 and LS[Zew(4)] < c，set 
LS Uev (4)] c and r r+1- Then if v 2 2 and LS Uew (4)] = c and 4 + v is even, set 
LS [lev (/fc)] c + 1 and p *<— p+ 1. After k reaches 5 , set r min(Ll 6 (p + a(r — p ))」， 
255), RANGE (c) r, and m r ^ m r + 1- 

Now resolve conflicts (see answer 287)，giving ACT(c) l 0 and RANGE (c) l 0 to 
all newly learned clauses c, and jump back to trail level 0. (A round of purging is a 
major event, something like spring cleaning. It is possible that 义 = 0, in which case 
one or more literals have been appended to trail level 0 and their consequences have not 
yet been explored.) Find the median range j as defined in ( 124 )，where T is half the 
total current number of learned clauses* If j < 256 and T > Sj, find h = T — Sj clauses 
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with RANGE (c) = j and ACT (c) as small as possible，and bump their range up to j + 1- 
(This can be done by putting the first rrij — h of them into a heap, then repeatedly 
bumping the least active as the remaining h are encountered; see exercise 6 . 1 - 22 .) 

Finally, go again through all the learned clauses c, in order of increasing c, ignor¬ 
ing c if RANGE (c) > otherwise copying it into a new location c < c. (Permanently 
false literals, which are currently defined at level 0 , can also be removed at this time; 
thus the clause’s size in MEM[c — 1] might be less than MEM [c — 1]. It is possible, but 
unlikely, that a learned clause becomes reduced to a unit in this way, or even that it 
becomes empty.) The activity score in MEM [c — 5] should be copied into MEM[c — 5]; 
but RANGE (c) and the watch links in MEM [c — 2] and MEM [c — 3] needn’t be copied. 

When copying is complete, all the watch lists should be recomputed from scratch 5 
as in answer 260, including original clauses as well as the learned clauses that remain. 

289. By induction, y k = (2 - 2 1 - fc )A + (2(k - 2) + 2 2 ~ k )S for all k > 0. 

290. Set k HEAP [0]; then if VAL(A:) > 0, delete k from the heap as in answer 262, 
and repeat this loop. 

291. OVAL (49) will be the even number 36， because of the propagations on level 18 
that led to ( 115 ). 

292. If AGILITY > 2 32 — 2 13 ，then ( 127 ) either subtracts 2 19 — 1 or adds 1- Hence 
there’s a minuscule chance that AGILITY will overflow by passing from 2 32 — 1 to 2 32 
(zero). (But overflow won’t be a calamity even if —— unbelievably —— it happens- So this 
is one “bug” in the author’s program that he will not try to fix.) 

293. Maintain integers Vf, and where Of has 64 bits. Initially U{ = Vf = Mf = 1. 

When M > Mf is detected in step C5, do this: If set U{ + 1, V{ 1, 

0{ 2 32 ^; otherwise set V{ 2v{ and — • 沒 f + ( 如》 4). Flush if AGILITY < Of. 

294. We have, for example , 夕 1100 = f (^oioo+^iooo+^mo ) 5 and 々 01*1 = 1. The solution 
is poo*i = poi*o = 9 ii*i = ^4/goo*o = 910*1 = 911*0 = B 丨 ID ， 910*0 = CjD ， where 
A = 3z — z 2 — z 3 ^ B = z 2 ^ C = z 3 , D = 9 — 6z — 3z 2 + 2 ： 3 - Hence the overall generating 
function is g = ( 6 A + 6J5 + 2(7 + 2D)/(16D); and we find g’(l) = 33/4 ， g”(1) = 147. 
Thus mean(^) = 8.25, var(^) = 87.1875, and the standard deviation is s 9-3- 

295. Consider all 3Q) clauses Xi V Xj V Xk for distinct {i^j^ k}^ plus two additional 
clauses (x\ V X 2 V ^ 3 ) A (xa V ^5 V xq) to make the solution 0 • • • 0 unique. Only the two 
latter clauses cause the variables Xt and Yt in the proof of Theorem U to deviate from 
each other. [C. Papadimitriou, Computational Complexity (1994), Problem 11-5-6. 
These clauses spell trouble for a lot of other SAT algorithms too.] 

296. The hinted ratio 2(2p + g+l)(2p + g)/(9(j? + l)(p + g+l)) is ^ 1 when p ^ q (more 

precisely when p = q — 7 0(l/q)). And f(q + l,g + l)/f(q,q) = 2(n - q)(3q + 3)-/ 

(27(^+1) 2 (2^+2)-) is ^ 1 when q ^ n/3. Finally ， /(n/3 ， n/3) = j^ {3/A) n (l+O(1/n)) 
by Stirling’s approximation, when n = 3q. 

297. (a) G q (z) = (z/3) q C(2z 2 /9) q = G(z) q where G(z) = (3 - V9 - 8z 2 )/(4z), by 
Eqs. 7.2.1. 6 -( 18 ) and ( 24 ). [See Algorithmica 32 (2002), 620-622.] 

(b) G q (l) = is the probability that Yt actually reaches 0, for some finite t. 

(c) If the Y process does stop ， G q {z)/G q (l) = {2G{z)) q describes the distribution 
of stopping times. Hence G r q (l)/G q {l) = 2qG r {!) = 3^ is the mean length of the random 
walk, given that it terminates. (The variance, incidentally, is 2Aq. A random Y -walker 
who doesn’t finish quickly is probably doomed to wander forever.) 

(d) The generating function for T, the stopping time of the Y process, is T\z )= 
{ n q )2^ n G q {z)\ and T is finite with probability T(l) = (|) n by (b). If we restrict 
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consideration to such scenarios, the mean T / (1)/T(l) is n; and Markov’s inequality 
tells us that Pr(T > N) < n/N. 

(e) The algorithm succeeds with probability p > Pr(T < N) > (1 — n/iV)(3/4 ) n ， 
when it is given satisfiable clauses. So it fails after K(4/3) n trials with probability less 
than exp(i^(4/3) n ln(l — p)) < exp(—i^(4/3) n _p) < exp(—i^/2) when N = 2n. 

298. Change 1/3 and 2/3 in ( 129 ) to 1/A: and (k — l)/k. The effect is to change G(z) 
to (z/k)C((k - l)z 2 /k 2 ), with G(l) = 1/0 — 1) and G'(l) = k/((k - l)(k-2)). As 
before, T(l) = 2— n (l + G(l)) n and : T’ （ 1)^T(1) = nG'{!)/{l + ^(1)). So the general¬ 
ized Corollary W gives success probability > 1 — e~ K ^ 2 when we apply Algorithm P 

K(2 - 2/k) n times with N = [2n/(k - 2)J . 

299. In this case G(z) = (1 — yjl — z 2 )/z\ thus (^(1) = T(l) = 1. But G"’(l) = oo, so 
we must use a different method. The probability of failure if iV = n 2 is 
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[See C- Papadimitriou ，Computational Complexity (1994)，Theorem 11.1 」 


300. In this algorithm, variables named with uppercase letters (except C and N) de¬ 
note bit vectors of some fixed size (say 64); each bit position represents a separate trial. 
The notation U r stands for a vector of random bits，each of which is 1 with probability 
1/r, independently of all other bits and all previous IPs. The maximum number of 
flips per bit position in this variant of Algorithm P is only approximately equal to N. 


PI’. [Initialize.] Set Xi ^ U 2 for 1 < i < n. Also set t 卜 （)• 

P2 7 . [Begin pass.] Set Z<—0 and j<—0. (Flipped positions are remembered in Z.) 

P3 7 . [Move to next clause.] If j = m, go to P5’ • Otherwise set j <— j + 1- 

P4 7 . [Flip.] Let Cj be the clause (/1 V • • • V 4)- Set Y Li & • • • & where 
Li denotes Xh if U = xh and Li denotes Xh if U = Xh. (Thus Y has Is in 
positions that violate clause Cj.) Set Z ^ Z \ Y and t ^ t + {Y^cl). Then 
for r = A:, A: - 1, , 2 set 疒卜 y & L r 卜 i 

Finally set L\ l la ㊉ Y and return to P3’. 


P5 7 . [Done?] If Z ^ —1, terminate successfully: One solution is given by the 
bits (Xi & B)... (X n & B), where B = Z h (Z 1). Otherwise, if t > N, 
terminate unsuccessfully. Otherwise return to P2 f . | 


The shenanigans in step P4’ have the effect of flipping the offending bits of each literal 
with probability 1/A:, thus distributing the Is of Y in an unbiased fashion. 


301. In practice we can assume that all clauses have limited size，so that (say) k < A 
in step P4’ • The clauses can also be sorted by size. 

A traditional random number generator produces bits U 2 ] and one can use U 2 KJ 2 
to get Ua. The method of exercise 3.4.1-25 can be used for other cases; for example, 


U 2 & (U 2 I (U 2 & (U 2 I (U2 & (U 2 I (U2 & (U 2 I (u 2 & t/ 2 )))))))) 
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is a sufficiently close approximation to lh. The random numbers needed in step PI’ 
must be of top quality; but those used in step P4’ don’t have to be especially accurate, 
because most of their bits are irrelevant- We can precompute the latter, making tables 
of 2 d values for each of lh, lh, Ua, and running through them cyclically by means of 
table indices U2P ， U3P ， U4P as in the code below，where UMASK = 2 朴 3 — 1. The values of 
U2P ， U3P，and U4P should be initialized to (truly) random bits whenever step P2’ starts 
a new pass over the clauses- 

Here is sample code for the inner loop, step P4’，for clauses with k = 3. The 
octabyte in memory location L + 8 (i—1) is the address in memory where is stored ， 
plus 1 if it should be complemented; for example, if h is ^ 3 , the address X + 3 x 8 + 1 will 
be in location L + 8 , where L is a global register. Register mone holds the constant —1- 


LD0U 

$1 ， L，0 

addr(Li) 

X0R 

$9 ， $6，$0 

L 3 

ST0U $ 6 ,$3,0 

Ls 

€ 


LD0U 

$4,$1,0 

1^11 

AND 

$7,$7 ,$8 


SUBU $7 ， $7，$0 




LD0U 

$2,L,8 

addr (L‘ 2 ) 

AND 

$7,$7,$9 

Y 

LDOU $0 ， U2 ， U2P 




LD0U 

$5, $2 , 0 


OR 

Z ， Z，$7 

Z 1 Y 

ADD 

U2P ， U2P，8 




LD0U 

$3 ， L，16 

addr(I/ 3 ) 

AND 

$0,$7,1 

Ykl 

AND 

U2P ， U2P，UMASK 



LD0U 

$ 6 ,$3,0 


ADD 

T ， T，$0 

new t 

AND 

$0 5 $0 j $7 < 

U 2 kY 

ZSEV 

$ 0 ,$ 1 ，mone 

LD0U 

$0 ， U3 ， U3P 


X0R 

$5,$5 ,$0 




X0R 

$7 ， $4，$0 

u 

ADD 

U3P ， U3P，8 


ST0U $5,$2,0 

L 2 ^ 


ZSEV 

$ 0 , $ 2 ，mone 

AND 

U3P ， U3P，UMASK 

SUBU 

$7,$7,$0 




X0R 

$ 8 ， $5 ， $0 

L 2 

AND 

$0 5 $0 5 $7 

Us^Y 

XOR 

$4 ， $4，$7 




ZSEV 

$ 0 ,$3，mone 

X0R 

$ 6 ， $ 6，$0 


ST0U $4,$1,0 

U € 

1 


302. Assume that literals are represented internally as in Algorithm A, and that all 
clauses have strictly distinct literals. An efficient implementation actually requires more 
arrays than are stated in the text: We need to know exactly which clauses contain any 
given literal, just as we need to know the literals of any given clause. And we also need 
a (small) array bo … bk-i to list the best candidate literals in step W4: 

W4. [Choose /•] Set c 卜 oo, j 卜 0, and do the following while j < k: Set j 卜 j + 
1 , l lj ； and if c\i\ < c, set c C|/|, 6o t Z, s —• 1; or if c\i\ = c, set 4— Z, 
5 5 + 1. Then if c = 0, or if c > 1 and U > set l <- 則； otherwise 

set l < — (Each random fraction U is independent of the others.) 

W5. [Flip L] Set 5 ^- 0 . For each j such that Cj contains make clause Cj hap¬ 
pier as follows: Set q t kj, kj I g + 1; and if g = 0, set s s + 1 and delete 
Cj from the / array (see below); or if ^ = 1, decrease the cost of Cj’s critical 
variable (see below)- Then set c … <—s and x\i\ <— x\i\. For each j such that 
Cj contains ^ make clause Cj sadder as follows: Set q t kj — 1, kj t q' and 
if g = 0, insert Cj into the f array (see below) ; or if ^ = 1, increase the cost 
of Cj’s critical variable (see below). Set t t + 1 and return to W2. | 

To insert Cj into /, we set / r 卜 j, Wj r, and r 卜 r + 1 (as in step Wl)- To 
delete it, we set h ^ Wj^ r ^ r — 1^ fh ^ fr^ Wf r h. 

Whenever we want to update the cost of Cj’s critical variable in step W5, we know 
that Cj has exactly one true literal- Thus，if the literals of Cj appear sequentially 
in a master array M ， it’s easy to locate the critical variable We simply set 

i <— START (j); then while Mi is false (namely while x\m { \ = Mi &1), set i i + 1- 

A slight refinement is advantageous when we will be increasing c\m { \ ： If « 7 ^ 
START(j)，swap M START ( J ) — > Mi. This change significantly shortens the search when 
c\m { \ is subsequently decreased. (In fact，it reduced the total running time by more 
than 5% in the author’s experiments with random 3SAT problems.) 
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303. In this case D = 3 — z — z 2 = Ajz, and we have 分 ’(1) = 3， = 73/4. Thus 
mean(^) = 3 and vax(g) = 12.25 = 3.5 2 - 


304. li vx = xi + - ^ + x n = a, there are a(n — a) unsatisfied clauses; hence there are 
two solutions，0 . •. 0 and 1. •. 1. li x\ ... x n isn^t a solution, Algorithm P will change 
a to a 士 1， each with probability Thus the probability generating function g a for 
future flips is 1 when a = 0 or a = n，otherwise it is z(g a -i + p a +i)/2. And the overall 
generating function is g = Y. a 0/2' Clearly g a = g n 一 a . 

Exercise MPR-105 determines g a and proves that the mean number of flips, ‘⑴, 
is a(n - a) for 0 < a < n. Thus = 2— n J^a=o = I ( 2 ) - 

Turning now to Algorithm W 5 again with + • • • + = a, the cost of is 

a — 1 when Xi = 1^ n — a — 1 when Xi = 0. Therefore gi = g n — i = z in this case. 
And for 2 < a < n — 2, we will move closer to a solution with probability q and 
farther from a solution with probability p, where p + q = 1 and p = p/2 < 1/2; here 
p is the greed-avoidance parameter of Algorithm W. Thus for 2 < a < n/2 we have 

9a = 9n — a = z {^19a—l ~f" P9a+l\ 

If p = 0, so that the walk is 100% greedy. Algorithm W zooms in on the solution, 
with g a = z a . Exercise 1.2.6—68 with p = 1/2 tells us that = n/2 — m (二 )/2 n = 

n/2 — ^/n/27r + 0(1) in that case. On the other hand if p = 1 5 so that the walk is 
greedy only when a = lora = n — 1, we’re almost in the situation of Algorithm P but 
with n decreased by 2. Then "’⑴ = 2 _n C)(l + (a — l)(n — 2) — (a — l) 2 )= 

n(n — 5)/4 + 2 + (2n — 4)/2 n ; greed triumphs. 

What happens as p rises from 0 to 1? Let’s decrease n by 2 and use the rule g a = 
z{qg a -i + P9a+i) for 1 < a < n/2, so that the calculations are similar to those we did 
for Algorithm P but with p now < 1/2 instead of p = 1/2. Functions t m and u m can be 
defined as above; but g a = (qz) a t m -a/t m ^ the new recurrence is t m +i = t m —pqz 2 t m 一 
and to = 1/^, uo = l/(qz). These functions are polynomials in p, and 2 ：, whose coeffi¬ 


cients are binomial coefficients: In the notation of exercise 1.2.9—15, for m > 0 we have 
tm = Gm - 1 ( -pqz 2 ) - pz 2 Gm - 2 ( -pqz 2 ) and u m = G m ^i(-pqz 2 ) -pzG m ^ 2 (-pqz 2 ), so 


T{w)= 


1 — pw 

q(l — w + pqz 2 w 2 ) 


U{w )= 


1 — (1 — qz)w 
qz{l — w + pqz 2 w 2 ) 


Consequently C(l)/t m (l) = 2pq(l- (p/q) m )/(q-p ) 2 - 2pm/(q-p) and 从。⑴/从爪 （1)= 
(2p- (p/q) m q)/(q~p) 2 -2p(m-^)/(q-p); g' a {l) = a/(q-p)-2pq((q/p) a -l)/(q-p) 2 
for 0 < a < n/2 when n is even, a/(q — p) — q((q/p) a — l)/(g — p) 2 when n is odd. The 
overall totals when n = 1000 and p = (.001，.01，.1，.5, .9，.99，-999) are respectively 
« (487.9, 492.3, 541.4, 973.7, 4853.4, 44688.2, 183063.4). 

305. That little additional clause reverses the picture! Now there’s only one solution, 
and greediness fails badly when vx > n/2 because it keeps trying to move x away 
from the solution. To analyze the new situation in detail, we need 3(n — 1) generating 
functions g a b^ where a = x\ + X 2 and 6 = a ：3 + • • • + x n . The expected number of flips 

will be ^(1), where g = 2~ n ELo E^o O CVW. 

The behavior of Algorithm P is ambiguous, because the unsatisfied clause found 
in step P2 depends on the clause ordering. The most favorable case arises when a = 2, 
because we can decrease a to 1 by working on the special clause Xi\/X 2 - Any other clause 
is equally likely to increase or decrease a + 6. So the best-case generating functions 
maximize the chance of reaching a = 2: 如 0 = 1， g 01 = f (如 o + pii)， go 2 = f ( 如 i + 仍 2 )， 
^10 = f (如 0 + 々 20 )， gii = f (分 10 + 仍 1 )， gi ‘2 = f (分 11 +分 22 )， and Q 2 b = zgib. The solution 

has gib = {z/{2 — z 2 )) b+1 ] and we find mean(p) = 183/32 = 5-71875. 
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The worst case arises whenever g 2 o ^ 2:^10 and Q 2 i 7 ^ 之分 11 ; for example we can 
take Q 20 = f (P 10 +P 21 )，921 = 奮 ( 920 + 922 )，together with the other seven equations from 
the best case. Then 分 01 = gio = z(4 — 3z 2 )/d^ go 2 = gn = 夕 20 = z 2 (2 — z 2 )/d^ and gvi — 
g ‘21 = z z jd, where d = 8 — 8 z 2 -hz 4 . Overall, g = {1+z ) 2 (2 — z 2 )/{Ad) and mean ( 分） = 11 . 

(This analysis can be extended to larger n: The worst case turns out to have g a b = 
g a +b = {z/2) a + h t n - a -b/tn^ in the notation of the previous exercise 5 giving n(3n — 1)/4 
flips on average. The best case has gib as before; hence g 0h = 36+2 —2 1 — 6 , g lb = 36 + 3, 
and g 2 b = 36 + 4 when z = 1. The best average number of flips therefore turns out to 
be linear, with mean(p) = |n — |(3/4) n .) 

The analysis becomes more exciting, but trickier, when we use Algorithm W- Let 
p = p/2 and g = 1 — p as in the previous answer. Clearly goo = 1, 分 01 = ^io = 么分 00 , 
go 2 = f (P 01 + 分 12 )，and ^22 = zgvi\ but the other four cases need some thought- We have 

911 = z (( 去 +g)(Poi+P10) + 夕 12 + 2 洲 21 )， 

since the costs for 尤 1 尤 2 $ 3 尤 4 = 1010 are 1211 and the unsatisfied clauses are (xiW X 4 )， 
( 无 3 V xa)^ ( 无 iVa> 2 ), ( 无 3 Va> 2 ); in the former two clauses we flip each literal equally often ， 
but in the latter two we flip X 2 with probability p and the other with probability q. 
A similar but simpler analysis shows that 分 21 = f (^n + 3 ^ 22 ) and g ‘20 = f (3^io + 2 ^ 21 )- 

The most interesting case is gvi = §(^02 + ^pgn + 3 收 22 )，where the costs are 
2122 and the problematic clauses are ( 无 iVa> 2 ) ， ( 无 3 Va> 2 ), (X 4 VX 2 ) - Ifp = 0, Algorithm W 
will always decide to flip X 2 ] but then we’ll be back in state 12 after the next flip- 

Indeed，setting p = 0 yields 如 0 = 1 ， g 01 = gio = z, g 0 2 = gn = \z 2 ^ 

g 2 o = , g 2 i = jqZ , and gvi = 922 = 0- The weighted total therefore turns out 

to be ^ = (40 + 1602 ： + 164 之 2 + 15 之 3 + 3 之 4 )/640. Notice that the greedy random walk 
never succeeds after making more than 4 flips, in this case; so we should set TV = 4 and 
restart after each failure- The probability of success is ^(1) = 191/320- (This strategy 
is actually quite good: It succeeds after making an average of 1577/382 ^ 4.13 flips 
and choosing random starting values xix^x^xa about 320/191 times.) 

If p is positive，no matter how tiny，the success probability for iV = 00 is ^(1) = 1- 
But the denominator of ^ is 48 — 48z 2 + 26pz 2 + 6 ^ 2： 4 — 17p 2 z 4 ^ and we find that 
mean(p) = (1548+2399p-255p 2 )/(1280p-680p 2 ) = (6192+4798^ -255p ,2 ) /(2560^ - 
680p /2 )- Taking p = (.001, .01 ， .1 ， .5 ， .9 ， .99, .999) in this formula gives, respectively, 
the approximate values (2421.3, 244.4, 26.8, 7.7, 5.9, 5.7, 5.7). 

(Calculations for n = 12 show that ^ is a polynomial of degree 8 when p = 0, 
with ^(1) ^ .51 and g (l) ^ 2.40. Thus，setting TV = 8 yields success after about 
16.1 flips and 1.95 initializations. When p > 0 we have g\l) » 1.635p 一 0 + 0(p~ 4 ) as 
p —0, and the seven values of p considered above yield respectively (5 x 10 16 , 5 x 10 11 , 
5 x 10 6 , 1034.3, 91.1, 83.89, 83-95) flips —— surprisingly not monotone decreasing in p . 
These Walks AT statistics can be compared with 17-97 to 105 flips for Algorithm R) 

306. (a) Since l(N) = En + (1 — qn)(N we have qNl(N) = En N — Nq 2 \ = 

pi + 2 p 2 H - h NpN + NpN+i H - + Npoc = TV — (gi H - h qN-i). 

(b) If = m + A: and A: > 0 we have En = m?/n, gi + • • • + qN-i = km In, and 

qN = m/n; hence l(N) = n + k(n — m)/m. 

(c) If iV<n, l(N) = (iV-( 2 )/n)/(iV/n) = n - otherwise l(N)=l(n) = 

(d) From qN = pi(N — qi —… 一 q N _i) and qN+i = 仍 （TV + 1 — 奶一 ••—— qN) 
we deduce pn+i = pi(1 — ^iv), hence 1 — Qiv+i = (1 — pi)(l — Qn)^ So it’s a geometric 
distribution, with pt = p(l — p) t ~ 1 for t > 1. (The fact that 1(1) = 1(2) = • • • is called 
the “memoryless property” of the geometric distribution.) 
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(e) Choose pi, …, p n arbitrarily, with q n = pi + • • • + < 1- Then, arguing as 

in (d) ， Pn+i ， j?n+ 2 , ••• are defined by 1 — ^at = (l — l//(n)) A '~ n (l — q n ) for N > n. 

(f) Since /(n + 1) —l(n) = (n_ (gi + • • • +g n ))(l — l/^ n ) < 0, we must have q n = 1 
and l{n) = l{n + 1)- (The case l{n) < l{n + 1) is impossible.) 

(g) Let x = pi and y = P 2 - By part (f), the conditions are equivalent to 
0 < x < x+y < 1 and x(3 — 2x — y) > 1. Hence 0 < {2x — l){l—x) —xy < (2x —1)(1 —x)\ 
we get the general solution by first choosing | < x < 1 ， then 0 < y < ( 2 x — 1)(1 — x)/x. 

(h) If TV* = ex) and l{n) < oo, we can find n with q u d{n) = pi+ 2 p 2 + - - -+n 7 p n / + 
n’p n ，+i + • • • + n poo > l{n). Hence /(TV) > qNl{N) > q u d{n) > l{n) for all N > n . 

(i) We have q n +k = k/{k + 1) for k > 0; hence l(n + k) = (k + l)(n + Hk)/k. 
The minimum occurs when l{n + k) ^ Z(n + A: — 1)，namely when n ^ k — thus 
A: = n + In n + 0(1). For example, the optimum cutoff value when n = 10 is 1ST = 23. 
(Notice that Eoo = oo, yet l = l(N^) » 14.194 in this case.) 

(j) Let pt = [t> 1}/2^\ Then l(N) = (3 - 2 2 ~ N )/(1 - 2 1 ~ N ) decreases to 3. 

(k) Clearly l < L. For N < L we have l(N) = (N — (gi + ... + qN-i))/qN > 
(AT-(1+ • • • -^(N-1))/L)/(N/L)~= L-(N-l)/2 > (L-f-l)/2. And for N = [L\+k^l, 
similarly, l(N) > TV - (1+• • • + [LJ +kL)/L = [_L + 1」 （1 - \_L\/(2L)) > (L+ 1)/2. 

307. (a) EX = En 1 + (1 — qN^^Ni + EX’)，where X r is the number of steps for the 
sequence (# 2 , ％， For numerical results，start with j l 0， s l 0， a 1; then ， 
while a > e，set j<—j + l^a<— (1 — qNj )a, and s <— s + En 5 + aNj. (Here e is tiny.) 

(b) Let Pj = (1 — qN 1 ) • • • (1 — QN j _ 1 ) = Pr(X > Tj)，and note that Pj < 
(1 — Pn) J ~ where n = min{t | pt > 0}. Since qNl(N) = En + (1 — Qn)N^ we have 

E-X = qN x l(Ni ) + (1 — qNi ) {qn 2 K^' 2 ) + (1 _ Qn 2 ){qn 3 1 (Ns) + ...)) 

OO OO 

= ^PjQNjljNj) = y^(Pj - Pj + i)l(Nj). 

(c) EX> Y ： T=i( P J - Pj+i)KN*) = L 

(d) We can assume that Nj < n for all j; otherwise the strategy would do even 

worse. For the hint，let {iVi ， … ， 7V r } contain r m occurrences of for 1 < m < n, 
and suppose t m = r m + — •+ r n . If t m < n/ (2m), the probability of failure would 
be (1 — m/n) tm > 1 — t m m/n > 1/2. Hence we have t m > n/(2m) for all m，and 
iVi + ... + iV r = ti t n > nH n /2. 

Now there’s some m such that the first r — 1 trials fail on p( m ) with probability 
> I• For this m we have EX > 臺 (iVi + • • • + N r -i) > |(iVi + … + iV r — n)• 

308. (a) 2 a+1 — 1; and we also have 5^+6 = 洗 +i for 0 < 6 < 2 a — 1 (by induction). 

(b) The sequence (u n ^v n ) in ( 131 ) has 1 + pk entries with u n = hr, and pi + 
• • • + pn = n — vn by Eq. 7.1.3—( 61 ) • From the double generating function g(w, z)= 
Sn>o wUnz，n — ( 1 +^^)( 1 +^^ 2 )( 1 +^^ 4 )( 1 +^^ 8 ) ... we deduce that ^ 2 k>0 z 2k+1 ~~ uk = 
zg(z~\z 2 ). 

(c) {n I = 2 a } = {2 a+ 1 k-^ 2 a+1 -l-j,k\k>0}; hence En>o z n [S n =2 a } = 
z 2a+1 - 1 g(z~\z 2a+1 ) = ，十 1 ， +， +1 ，(l + ，〜吣 +， +3 - T ).... 

(d) When 2 a occurs for the 2 6 th time，weVe had 2 a+b ~ c — [c > a] occurrences 
of 2 C , for 0 < c < a + 6 . Consequently S(a, 6 , 1) = (a + 6 — l)2 a + 6 + 2 a+1 . 

(e) The exact value is ^2^1 q 2 a+b ~ c 2 c + 2 a+6+c ; and pk < \k = [lg k\. 

(f) The stated formula is Emin/ c {S(a, 6 , A:) | D(a, 6 , k) > X}^ if we penalize the 
algorithm so that it never succeeds unless it is run with the particular cutoff N = 2 a . 
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(g) We have Q < (1 — qt) 2b < (1 — qt) l ^ qt < e _1 ; hence EX < (a + 6 — l)2 a+6 + 
2 a+1 ^Y^=i^^^-^ a+h e~ k = ^ a+h {^ + b)e/{e-l) + e{?>-e)/{e-lf^2 1 - h ). 
Furthermore we have 2 a+b < 81 — 4 / [6 = 0], by exercise 306(k). 

309. No —— far from it. If Algorithm C were to satisfy the hypotheses of exercise 306, it 
would have to do complete restarts: It would not only have to flush every literal from 
the trail, it would also have to forget all the clauses that it has learned, and reinitialize 
the random heap. [But reluctant doubling appears to work well also outside of Vegas.] 

310 . A method analogous to ( 131 ) can be used: Let = (1,0); then define 

(^ 4 . 1 , ^n+i) = ( u n ^ —u’ n = 1 《 v’ n ? (succ(?x / n ), 0): (u n , v n + 1)). Here C succ^ is the 
Fibonacci-code successor function that is defined by six bitwise operations in answer 
7-1.3-158. Finally, let S n — F v f + 2 for n > 1. (This sequence {S r n ), like (S n )^ is “nicely 
balanced ”； hence it is universal as in exercise 308. For example, when F a appears 
for the first time，there have been exactly F a + 2 -c occurrences of for 2 < c < a.) 

311. Because (R n ) does surprisingly well in these tests，it seems desirable to consider 
also its Fibonacci analog: If f n = succ(/ n _i) is the binary Fibonacci code for n，we 
can call (pn) = (pf n ) = (0 ， 1 ， 2,0,3,0 ， 1 ， 4,0,…） the “Fibonacci ruler function，” and 
let (R f n ) = (1 ， 2, 3,1 ， 5,1 ， 2, 8 ,1， •••）be the “ruler of Fibonacds，” where R f n = F 2+p ， n . 

The results (Es, E s ，， Er, E r >) for m = 1 and m = 2 are respectively (315.1, 357.8, 
405.8, 502.5) and (322.8, 284.1 ， 404.9, 390.0); thus S beats 5’ beats R beats R r when 
m = 1, while S r beats S beats R f beats R when m = 2. The situation is, however, 
reversed for larger values of m: R beats R f beats S beats S f when m = 90, while R f 
beats R beats S f beats S when m = 89. 

In general, the reluctant methods shine for small m, while the more “aggressive” 
ruler methods forge ahead as m grows: When n = 100, S beats R if and only if m < 13, 
and S f beats R r if and only if m < 12. The doubling methods are best when m is a power 
of 2 or slightly less; the Fibonacci methods are best when m is a Fibonacci number or 
slightly less- The worst cases occur at m = 65 = 2 6 + 1 for 5 and R (namely 1402.2 
and 845-0); they occur at m = 90 = Fn + 1 for S f and R f (namely 1884.8 and 805.9). 

312. T(m, n) = m -\- b2 b ho(0)/0 + 2 b g(0)^ where b = [lg m], 0 = 1 — m/n, h a (z )= 
En z n [S n =2 a ], and g(z) = E n >i = 

313. If l is flipped, the number of unsatisfied clauses increases by the cost of |/| and 
decreases by the number of unsatisfied clauses that contain /; and the latter is at least 1 - 

Consider the following interesting clauses, which have the unique solution 0000: 

XI V X 2 ^ Xi V X 2 ^ X 2 V X3^ X 2 V X 3 , X 3 V X 3 V X\ V X\. 

“Uphill” moves 1011 1111 and 1101 1111 are forced; also 0110 i—^ 1110 or 0111. 

314. (Solution by Bram Cohen, 2012.) Consider the 10 clauses 1234567, 1234567, 
12345, 12346, 12347, 1234, 1235, 1236, 1245, 1246, and 60 more obtained by the cyclic 
permutation (1234567). All binary x = xi ... X 7 with weight vx = 2 have cost-free flips 
leading to weight 3, but no such flips to weight 1. Since the only solution has weight 0, 
Algorithm W loops forever whenever vx > 1. (Is there a smaller example?) 

315. Any value with 0 < p < 1/2 works, since each graph component is either K\ or K 2 . 

316. No; max^(l — 0) d for 0 < ^ < 1 is d d /(d -f- l) d+1 , when 6 = l/(d 1). [But 
Theorem J for c? > 2 is a consequence of the improved Theorem L in exercise 356(c).] 

317. Number the vertices so that the neighbors of vertex 1 are 2, … ， d!, and let 

Gj = G \ {1，• • • ， ）}• Then a{G) = a{G\) — Pr(Ai fl A ‘2 fl • • • fl A m )，and the latter 
probability is < Pr(AiDA^/ +1 n- - -nA m ) = Pr(Ai | < pa(G^). 
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Let p = (d — l)/d. By induction we have a(Gj) > pa(Gj+i) for I < j < 
because vertex j + 1 has degree < d in Gj. If d f = 1 then a(G) > a{Gi) — pa(Gi) > 
pa{G\) > 0- Otherwise if d! < a(G) > a(Gi) — pa(Gdf) > a{G\) — pp l ~~ d， a{G\) > 
a{G\) — pp 1 ~ d a(Gi) = pa(Gi) > 0. Otherwise we must have d! = d+ 1， with vertex 1 
of degree d, and a(G) > a(Gi) — pp~ d a(Gi) = > 0. 

318 . Let f n = Mg ip) where G is the graph of a complete t-ary tree with i n leaves; 
thus G has t k vertices at distance k from the root, for 0 < A: < n- Then 

f 0 = 1 -p, fi = (1 -pY -p, and / n +i = fn ~ p/n-i n > 1. 

By Theorem S，it suffices to show that / n < 0 for some n. 

The key idea is to let = 1 — p and g n +i = / n +i//n = 1 — p/9n- Assuming that 
g n > 0 for all n, we have gi < g 0 and g n - g n +i = p/gh -p/gn+i > 0 when g n+1 < g n . 
Hence lim n _^oo = 入 exists，with 0 〈入 < 1. Furthermore 入 =1 — p / 入 、 so that 
p = Y(1 — A). But then p < ^/{t + l ) t+1 (see answer 316 with 沒 = 1 — 入 ）. 

[One must admit ， however，that the limit is not often reached until n is extremely 
large. For example, even if t = 2 and p = .149, we don’t have / n < 0 until n = 45. 
Thus G must have at least 2 4 ° vertices before this value of p is too large for Lemma L.] 

319 . Let x = l/(d — 1). Since e x > 1 x = d/(d — 1), we have e > (d/(d — l)) d_1 . 

320 . (a) Let fm(p) be the Mobius polynomial when pi = ••• = p m = p. Then we have 
fm{p) = /m-i(p) — pf m 一 and one can show by induction that / m (1/(4cos 2 0 ))= 
sin((m + 2)^)/((2cos ^) m+1 sin 0). The threshold decreases to 1/4 as m —oo. 

(b) 1/(4cos 2 2 ^ 7 ); the Mobius polynomial g m (p) = fm-i(p) — pfm - 3 (p) satisfies 
the same recurrence as f m (p), and equals 2 cos mO / (2 cos 0) m when p = 1/(4 cos 2 6). 

[In terms of the classical Chebyshev polynomials, g m (p) = 2p m// 2 T m (l/(2^/p)) 
and fm(p) = p (m+1)/2 U m +i (l/( 2 v^)).] 

321 . Let 6 = (2 - ^2)/2, O' = 0(1 - 0) = (^2 - 1)/2, and c = (p - 0)/(1 - 6). 
The method of answer 345 gives (Pr(ABCD)^ Pr(ABCD)^ Pr(ABCD)^ Pr(ABCD)^ 
Pv(ABCD), Pr(ABCD)) = (0, 0'(l-cf, 26» , (l-c) 2 c, 6» 2 (l-c) 2 +26» , (l-c) 3 , 6 » 2 (l-c)c+ 
30’(1 — c)c 2 , 0 2 c 2 +40 / c 3 ). Other cases are symmetric to these six. When p = 3/10 the 
six probabilities are » (0, .20092, .00408, .08815, .00092, .00002). 

322 . (a) Let aj = Wi[ij G A], bj = Ylk V^[j^ ^ c\ = 匕价 ㈣ G C], an d 山 = 
Wi[li G D]. Then when X = j and Z = l, the best way to allocate the events is 



within W and Y. Hence Pr(AnBf)CDD) = ^ 7 * i x j^^i) — l) {(bj +ci) — l)^ which 
is zero if and only if we have aj + di > 1 or + c/ > 1 for all j and l with XjZi > 0- 
(b) Since Xj(aj，bj) = (p, p ), the point (p^p) lies in the convex hull of the 
points (cij’bj). So there must be points (a ， 6) = (aj, bj ) and (a’，b’）= (ay ,bj，) such 
that the line from (a ， 6) to (a ^b f ) intersects the region {(x^y) | 0 < ?/ < p}; in other 

words + (1 — /j)a < p and #6 + (1 — /i)b f < p. Similarly we can find c, d ， c, ， d\ v. 
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(c) Fact: If a > I and b f > then /i = |; hence a = = | and a’ = b = 0. 

Notice also that there are 16 symmetries，generated by (i) a -H* 6 , c d; (ii) a a ’， 
b O b f ， H 1 — fi' (iii) c e c 、 d O d! , v ^ \ — v\ (iv) a O d, b O c, [i O v. 

If c < c 7 and d < d! ^ or \i c < | and d < ^ we can assume (by symmetry) that 
the Fact applies; this gives a solution to all the constraints，with c = d = c=d! = \. 

For the remaining solutions we may assume that a^b r > \ > a!,h. Suppose the line 
from (a ，6 ) to (a’ ，6 ’）intersects the line from ( 0 , 0 ) to ( 1，1 ) at the point (a ， a); dividing 
a,h, a!,h’ by 3a gives a solution in which fia + (1 — fi)a’ = + (1 — fi)b f = Similarly ， 

we can assume that d,c’ > ^ > d、c and that + (1 — v)c = vd + {1 — v)d! = -|- 
Consequently a + d > 1 and b r + d > 1- Symmetry also allows us to assume that 
a + C ? 7 > 1 - In particular, a > |; and, by the Fact ， b r < So a 7 > 1, d > c < |. 

Now extend the lines that connect (a ，6 ) to (a’ ，6 ’）and (c, d) to (c’ ， d’），by increas¬ 
ing a, h’ ， d, d while decreasing a!, h, c, d!, until a = 1 — d and a = 1 — d’, and until 
either a = 1 or 6 = 0, and either d = 1 or c = 0. The only solution of this kind with 
+ c 7 > 1 occurs when a = d = 1, a’ = b = c = d’ = 0,6’ = c =1/2, /i = 全 ， ^ 

(d) For the first solution，we can let W^ X, Z be uniform on {0, 1 ， 2} ， {0,1 }， 
{0,1,2}, and {0}, respectively; and let A = {10,20}, B = {11,12}, C = {00}, 
D = {00}- (For example, WXYZ = 1110 gives event B.) The second solution turns 
out to be the same, but with (X, W) in place of (W, X, Z). Notice that the 
solution applies also to where the threshold is [See STOC 43 (2011) ， 242.] 

323 . cbc. In this simple case, we just eliminate all strings in which c is followed by a. 

324 . For 1 < j < n, and for each v such that v = Xj or v —— let i -< j for each i < j 
such that v = Xi. (If several values of i qualify，it suffices to consider only the largest 
one- Several authors have used the term “dependence graph” for this partial ordering.) 
The traces equivalent to a correspond to the topological sortings with respect to - < 5 
because those arrangements of the letters are precisely the permutations that preserve 
the empilement. 

In ( 136 )，for example, with xi .. .x n = bcebafdc, we have 1 2, 1 〆4， 2 弋 4, 

4 - < 5， 3 - < 6 , 2 弋 7， 3 S 7, 2 - < 8 , 4 - < 8 , and 7 -< 8 - Algorithm 7.2.1.2V produces 105 
solutions，12345678 (bcebafdc) through 36127485 (efbcdbca). 

325 . Every such trace a yields an acyclic orientation, if we let u — >v when u appears 
at a lower level in a’s empilement- Conversely, the topological sortings of any acyclic 
orientation are all equivalent traces; so this correspondence is one-to-one. [See Ira M. 
Gessel, Discrete Mathematics 232 (2001), 119-130.] 

326 . True: x commutes with y if and only if y commutes with x. 

327. Each trace a is represented by its height h = h(a) > 0, and by h linked lists 
Lj = Lj (a) for 0 < j < h. The elements of Lj are the letters on level j of a’s 
empilement; these letters have disjoint territories, and we keep each list in alphabetic 
order so that the representation is unique. The canonical string representing a is 
then LqLi ••• Lh-i. (For example, in ( 136 ) we have Lo = be, L\ = c/, L 2 = bd, 
Ls = ac，and the canonical representation is becfbdac.) We also maintain the sets 
Uj = |J{T(a) I a G L] as bit vectors; in ( 136 )，for example，they are Uo = # 36, 
Ui = # lb ， C /2 = # 3c ， C / 3 = # 78- 

To multiply a by /?, do the following for A: = 0 ， 1 ，••” h((3) — 1 (in that order )， 
and for each letter b G Lk(/3) (in any order) : Set j ^ h(a)] then while j > 0 and 
T( 6 ) & Uj-i(a) = 0, set j ^ j — 1. If j = /i(a)，set Lj (a) empty ， Uj(a) <— 0, and 
h(a) <— h(a) + 1- Insert b into Lj (a) , and set Uj(a) ^ Uj (a) + T(b). 
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328. Do the following for k = h((3) — 1, • • •, 1 ， 0 (in that order)，and for each letter 
b G Lk{p) (in any order): Set j < - h(a) — 1; while j > 0 and T ⑹ & Uj(a) = 0, set 
j j — 1. Report failure if b isn’t in Lj (a) - Otherwise remove b from that list and set 
Uj(a) Uj(a) — T{b)\ if Uj(a) is now zero, set h(a) h(a) — 1. 

If there was no failure, the resulting a is the answer. 

329. Do the following for A; = 0, 1， • • • ， h(a) — 1 (in that order)，and for each letter 

a G Lk(a) (in any order) : Report failure if a isn’t in Lo(/3). Otherwise remove a from 
that list, set Uo(/3) Uo(/3) — T(a), and renormalize the representation of /3. 

Renorm aliz at ion involves the following steps: Set j ^ c <— 1. While Uj-i(/3) ^ 0 
and c — 0, terminate if j = h(/3)] otherwise set c 卜 0, j j+1, and then, for each letter 
b in Lj-i(/3) such that T(b) & CTj— 2 ( 用 = 0, move b from to Lj 一 2 (p) and set 

Uj- 2 (0) C/j- 2 ⑹ + T ⑻ , Uj-ii/3) Uj-i(/3)-T(b), c^l. Finally, if C / 卜 “ 用 = 0, 

set Ui-i(/3) •<— Ui(/3) and Li-i(/3) *<— Li(/3) for j <i< then set h(/3) h(/3) — 1. 

If there was no failure, the resulting p is the answer. 

330. Let the territorial universe beVUE^ the vertices plus edges of and let T(a)= 

{a} U {{a, 6 } I a —— b}. [G. X. Viennot，in 1985， called this subgraph a starfish.] 

Alternatively, we can get by with just two elements in each set T(a) if and only if 
G = L(H) is the line graph of some other multigraph H. Then each vertex a of G 
corresponds to an edge u —— v in H, and we can let T(a) = {u^ v}. 

[Notes: The smallest graph G that isn’t a line graph is the “daw” i^i ， 3 . Since 
sets of independent vertices in the line graph G are sets of disjoint edges in H, also 
called matchings of H, the Mobius polynomial of G is also known as the “matching 
polynomial^ of H. Such polynomials are important in theoretical chemistry and physics. 
When all territories have |T(a)| < 2, all roots of the polynomial Mq{z) in ( 149 ) are 
real and positive，by exercise 341 • But M c i aw (z^ z) = 1 — 42 ： + 3z 2 — z 3 has complex 
roots ^ 0.317672 and 1.34116 士 1.16154i] 

331. If a is a string with k > 0 occurrences of the substring ac, there are 2 k ways 
to decompose a into factors {a ， 6 ， c ， ac}, and the expansion includes +a and —a each 
exactly 2 k ^ 1 times. Thus we’re left with the sum of all strings that don’t contain L ac\ 

332. No: If b commutes with a and c，but ac ^ ca ， we’re dealing with strings that 
contain no adjacent pairs ba or c 6 ; hence cab qualifies, but it’s equivalent to the smaller 
string bca. [Certain graphs do define traces with the stated property, as we’ve seen in 
( 135 ) and ( 136 ). Using the next exercise we can conclude that the property holds if 
and only if no three letters a < b < c have a —j—h,h —j—c, and a —— c in the graph G 
of clashes- Thus the letters can be arranged into a suitable linear order if and only if 
G is a cocomparability graph; see Section 7-4.2.] 

333. To show that ^2 aeA l)' ,/ 3 'a/5 = 1， let 7 = ai • • • a n be any nonempty string. 

If 7 cannot be factored so that ai .. .ak G A and a^i •. • a n G then 7 doesn’t appear. 
Otherwise 7 has exactly two such factorizations, one in which k has its smallest possible 
value and the other in which k is one greater; these factorizations cancel each other in 
the sum. [Manuscripta Math. 19 (1976) ， 239-241.] 

334. Equivalently we want to generate all strings of length n on the alphabet {1, •…, m) 
that satisfy the following criterion, which strengthens the adjacent-letter test of exer¬ 
cise 332: If 1 < i < j < Xi —j—Xj, x^i —j— Xj, •…， Xj^i —j— Xj, then < Xj. [See 
A. V. Anisimov and D. E. Knuth ， Ini. J. Comput. Inf. Sci. 8 (1979) ， 255-260.] 

Tl. [Initialize-] Set xo ^ 0 and x/c 1 for 1 < A: < n. 

T2. [Visit-] Visit the trace xi ... x n . 
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T3. [Find k.] Set k ^ n. While Xk = m set k k — 1. Terminate if A: = 0. 

T4. [Advance Xk] Set Xk^Xk + ^ and j ^ k — 1. 

T5. [Is Xk valid?] If Xj > Xk and Xj —f—xu, return to T4. If j > 0 and Xj < Xk 
and Xj —/—Xk, set j i- j — 1 and repeat this step. 

T6. [Reset Xk+i - . - x n .] While k < n do the following: Set A: ^ A: + 1, x/c ^ 1; 
while Xk-i > Xk and Xk-i —f—Xk, set Xk ^ Xk + 1- Then go back to T2. | 

335. Given such an ordering, we have Mg = det (/ — A )， where the entry in row u and 
column v of A is v[u > v or u — v]. The determinant in the given example is 


det 


/I 

-b 

—c 

0 

0 

0 、 


(- a 

-b 

—c 

0 

0 

0 \ 

0 

1-b 

0 

-d 

0 

0 1 


0 

1 

c 

-d 

0 

0 

0 

~b 

1 —c 

~d 

—e 

0 

+ det 

0 

0 

1 

-d 

—e 

0 

0 

—b 

—c 

1-d 

0 

—f 

0 

0 

0 

1-d 

0 

-f 

0 

-b 

—c 

-d 

1 —e 

-f i 


0 

0 

0 

-d 

1 —e 

-f 

Vo 

~b 

—c 

~d 

—e 

1 —// 

\ 0 

0 

0 

~d 

—e 

1-// 


after expanding the first column, then subtracting the first row from all other rows in 
the right-hand determinant- Therefore this rule satisfies recurrence ( 142 ). 

[The result also follows from MacMahon’s Master Theorem，exercise 5.1.2—20, 
using the characterization of lexicographically smallest traces in answer 334. Accord¬ 
ing to Theorem 5.1-2B, such traces are in one-to-one correspondence with multiset 
permutations whose two-line representation does not contain v u when v > u and v—j—u. 
Is there a similar determinantal expression when G is not a cocomparability graph?] 

336. (a) If a is a trace for G and /3 is a trace for H, we have (a/3) = 阳 (/?)• 

Hence Mg^h = MqMh- (b) In this case \ig ~^(a/5) = \ig [ol) if = e ，[in (/5) if a = e; 
otherwise it’s zero- Therefore Mg~h = Mg + Mh — 1- 

[These rules determine Mq recursively whenever G is a cograph (see exercise 7- 
90)- In particular, complete bipartite and A:-partite graphs have simple Mobius series ， 
exemplified by Mq = (1 — a)(l — 6)(1 — c) + (l —d)(l — e) + (l —/) — 2 when G = Ks, 2 ,i-] 

337. Substituting ai H - h for a in Mg gives M G ，• (Each trace for G r is obtained 

by putting subscripts on the a’s of the traces for G.) 

338. The proof of Theorem F needs only minor changes: We limit a to traces that 
contain no elements of and we define a and 〆 by letting a be the smallest letter ^ A 
in the bottom level of 7 ’s empilement. If 7 has no such letter, there’s only one factor¬ 
ization, with a = e. Otherwise we pair up cancelling factorizations- [Incidentally, the 
sum of all traces whose sinks are in A must be written in the other order: ^g 1 ^G\A-] 

339. (a) “Push down” on piece Xj and factor out what comes through the floor. 

(b) Factor out the pyramid for the smallest label, and repeat on what’s left. 

(c) This is a general convolution principle for labeled objects [see E. A. Bender 

and J. R. Goldman, Indiana Univ. Math. J. 20 (1971), 753-765]- For example, when 
l = 3 the number of ways to get a labeled trace of length n from three labeled pyramids 
is (ij,k) PiP j Pk /^- = n! with i+j + k = n in both 

of these sums. We divide by 3! so that the topmost pyramid labels will be increasing. 

(d) Sum the identity in (c) for Z = 0 ， 1 ， 2, •… 

(e) T(z) = ^> 0 ^^ = 1 / m g(z) by Theorem F, and P(z) = Yl n >iPriZ n /n. 
Note: If we retain the letter names, writing for example Mq(z) = 1 — (a + 6 + c ) 2 ： + acz 2 
instead of Mg(z) = l-3z-hz 2 ^ the formal power series — In Mg(z) exhibits the pyramids 
of length n in the coefficient of z n ^ but only in the sense of commutative algebra (not 
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trace algebra). For example, the coefficient of obtained from — Mc(z)) k /k 

with trace algebra includes the nonpy rami dal term bac/6. 

340- Let w((ii ... i k )) = (-l) k ^ 1 ai 1 i 2 ai 2 i 3 .. thus w(tt) = (-ai 3 a 3 4a42a2i) 

(―« 57 奶 5 ) (a 66 ) in the given example. The permutation polynomial is then det by 
definition of the determinant- (And we get the permanent，if we omit the (—l)^ -1 -) 

341. The hint is true when n = 2, since the first involution polynomials are wux and 

W 11 W 22 X 2 —wi 2 . And there^s a recurrence: W(S) = wuxW(S\i) — j})- 

So we can prove the existence of n + 1 roots 5 i < n < • • • < r n < s n +i by induc¬ 
tion: Let W n (x) be the polynomial for {!• ， ••. ， n}. Then W n -\-i (x) is W( n ^i)( n ^i)xW n (x) 

minus n polynomials w;( n+1 )j VF({ 1 ， •… ， n} \ j)，each with roots that are nicely 
sandwiched between the roots of W n . Furthermore q^l k = —qf and r n +i—& = —r/c, 
for 1 < A: < n/2. It follows that Wn+i(r n ) < 0, Vl^ n +i(r n —i) > 0， and so on, with 
(—l) k W n -]-i(rn-\-i-k) > 0 for 1 < A: < n/2. Moreover ， Wn+i(0) = 0 when n is even; 
(—l) /c W n +i(0) > 0 when n = 2k — 1] and W n ^-i(x) > 0 for all large x. Hence the 
desired Sk exist. [See Heilmann and Lieb ，Physical Review Letters 24 (1970), 1412.] 

342. If we replace (ii • • • ik) by ai 1 i 2 ai 2 i 3 ... ai k i 1 (as in answer 340， but without the 
(—l)^ -1 ), then Mc n becomes det(/ —A). Replacing aij by aijXj gives the determinant 
in MacMahon^s Master Theorem. And if x\ = • • • = x n = we get the polynomial 
det(I-xA)^ whose roots are the reciprocals of the roots of A^s characteristic polynomial. 

343. The formulas in answer 336 show that Mg?( pi ， … ， Pm) increases whenever any 

Pj decreases, with respect to a cograph G. The only graph on < 4 vertices that isn’t 
a cograph is Pa (see exercise 7-90); then Mo(pi^P2^P3^P4) = 1 — pi — p 2 — P 3 — P 4 + 
P 1 P 3 + PiP4 + p‘2P4 = (1 — pi)(1 — 仍一 Pa) — p‘ 2 (l — Pa) - In this case also we can 
conclude that Mg (pi ， … ， P 4 ) > 0 implies (pi ， … ， P 4 ) G 1Z{G). But when G = we 
find Mg (1 — e ，1 — e ， e ，1 — e ，1 — e) >0 for 0 < e < 0 -2 ; yet (1 — e ，1 — e ， e ，1 — e ，1 — e) 
is never in 1Z(G) because 0, e，1 — e，1 — e) = —(1 — e) 2 - 

344. (a) If some minterm, say B 1 B 2 JB 3 B 4 , has negative “probability,” then pip4 x 

(1 — 7 T 2 — 7 T 3 + tt‘ 23 ) < 0; hence 0) < 0 violates the definition of 1Z(G). 

(b) In fact，more is true: ttiuj = 7 T /ttj when i j for i G j G J, and IDJ = 0. 

(c) It’s Mc(pi[l G J],. •. ^p m [m G J]), by ( 140 ) and ( 141 )- This important fact, 
already implicit in the solution to part (a)，implies that /3{G \ J) >0 for all J. 

(d) Writing just ‘J’ for { G\J\ we shall prove that a(i U U J) > a(J)//3(J) 

for i 味 J, hy induction on \J\. Let J f = {j e J \ i-f— j}. Then we have 


a(i U J) = a(J) - Pr( Ai D 门不 ) >a(J)-Pr^A, n 门不 ) > a(J) -pia(f), 

jeJ jer 

because of ( 133 ). Also /3(iU J) = /3(J) —pi/3(J , ). Hence a(i U J) /3(J) — a(J)/3(i U J) > 
(«(J) - pia(f))/3(J) - a(J) (/3(J) — pi/3(f)) = pi(a(J)/3(f) - a(f)/3(J)), which is 
> 0 by induction since J f C J. 

[This argument proves that Lemma L holds whenever Q?i ， … ,p m ) leads to a le¬ 
gitimate probability distribution with /3(G) > 0; hence such probabilities are in 7 ^(^)-] 
(e) By induction, we have /3(i U J) = /3(J) — _^(1 ~ 0j) > /3(J) — 

Oi0(J f ) n^jxj/C 1 - °j) > (! - 咖 (J), because G _ 心 ). 


345. (Solution by A. D. Scott and A. D. SokaL) Set p f j = (1 + S)pj where ^ < 0 is 
the slack of (pi ， … ， Pm). Then Mg (pi, … ， P’m) = 0, but it becomes positive if any 


p f j is decreased. Define events B’ u …， B f m by the construction in exercise 344. Let 
(7i, … ， C m be independent binary random variables such that Pr((7j = 1) = qj, 
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where (1 - Pj){l - qj) 


1 - 


Then the events Bj = Bj V Cj satisfy the desired 


conditions: Pr(Bi | Bj 1 fl • • • fl Bj k ) = Pr(Bi \ Bj 1 fl • • • fl Bj k ) = Pv(Bi) = pi ； and 
Pr(J5i V … V B m ) > Pr(B[ V … V = 1. 

346. (a) By ( 144 ) ， K a ,G is the sum of all traces on the probabilities of G \ a whose 
sources are neighbors of a. Decreasing pj doesn’t decrease any trace- 

(b) Suppose vertex a = 1 has neighbors 2 ， … ， j. If we’ve recursively computed 
M G \ a * and M G \ a ，finding that ( 巧 +i，• • • ,p m ) e 7Z(G\a^) and (p 2 , • • ^Pm) G 7Z(G\a)^ 
then we know K a ,G] and the monotonicity property in (a) implies that (pi, … ^ 
1^{G) if and only if aK a ^o < 1 - 

a o ~o b 

The graph G = c i — id in exercise 335 can, for example, be processed as follows: 

e^-of 


-^-abcdef 


bcdef 


1—a 


Mbcdef = M c def (1—6 


Mcdef = Mdef ( 1 —C 


Mdef 
Mbcdef 
M ce f 
M c d e f 

Mf 


M ce f = M e f (1 —c 


Mdef = M e f ( 1-d 


Mdef 
Mf 

w f 

M e 


=(1 —a’)(l — 6’） … （1 — /’) ， a 1 
=( 1 — 6 ’)(l_c ’) … (1 -/’)， b 1 

(1 —c， )(l ~^) (1 —e， )(l — /)? c， 


a 


ef 


M e f = Mf ll — e 


e 


MAl-e 


M f =M e (l-f 


M e 

Mf 

M e 

Me 

M e 

Me 


(1 —/), 


c 


n 


bil-c") 

c 

c 


e 


r 

e 


// 


(l-〆)’ 

d(l-e") 

(l-eO(l-f) 

e 

(W 7 ) 5 


e 


/ = /， 


with M e = 1. (The equations on the left are derived top-down, then the equations on 
the right are evaluated bottom-up. We have (a,b, … ， f) G 1Z(G) if and only if /’ < 1 ， 
e" < 1, e 7 < 1, … ， a 7 < 1.) Even better is to traverse this graph in another order, 
using the rule Mq^h = MqMh (exercise 336) when subgraphs aren’t connected: 


■^■cdabef = -^-dabef ( 1 — C 


M f 


-^dabef = ^ab ^ef ( 1 — d 


^-dabef 

a 


(1 一 (^)(1 一 (/) • • • (1 一 /)， C 


c 




MabM e f 


{l—d r ){l — a){l — b r ){l — e){l — f)^ (see below) 


M ah = M b 11-a 


a 


MAl-a 


M b = M e 1-6 


M e 

M~ b 

Me 

Me 

M e 

Me 


(1—a’ ） (l-6’) 


( 1-0 

( 1 -吖 


a 


a 


( 1 - 6 ，) 


// 

a = a 


y = b 


where d f = dM a M e /(M ab M ef ) = d(l _e") 八 (1 _a’)(l _6 7 )(1 — ^)(1 _/’))，and 

M eh M e , M h M e are as before- In this way we can often solve the problem in linear 
time. [See A. D. Scott and A. D. Sokal ， J. Stat. Phys. 118 (2005) ， 1151—1261 ， §3.4.] 
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347. (a) Suppose vi —— m - Vk —— vi is an induced cycle. We can assume that 

vi y V 2 . Then，by induction on j，we must have vi >Vj for 1 < j < A:; for if 
y Vj we would have Vj+i —— Vj-i by (*)• But now Vk —— vi implies that k = 3. 

(b) Let the vertices be {1, •… ，爪 }, with territory sets T(a) C [/ for 1 < a < m; 
and let f/ be a tree such that each U | T\a) is connected- Let U a be the least common 
ancestor of T(a) in [A (Thus the nodes of T(a) appear at the top of the subtree rooted 
at U a -) Since U a G T(a), we have a —— b when U a = 

Writing 5 yu t for the ancestor relation in [/, we now define a y b if U a Ub 
or if U a = Ub and a <b. Then (*) is satisfied: If 尤 G T(a) fl T( 6 )，we have U a >-u t and 
Ub >-u ^ hence U a hu Ub or Ub >：u U a , hence a y b or b y a. And if a y b y c and 
t G T"(a) fl T(c)，we have U a bu Ub >：u U c ] consequently Ub G T"(a) fl T( 6 ), because Ub 
lies on the unique path between t and U a in U and T(a) is connected. 

(c) Processing the nodes in any order such that a is eliminated before b whenever 
U a is a proper ancestor of Ub will then lead only to subproblems in which the algorithm 
needs no “double-primed” variables. 

For example, using (a,b, ••” g) instead of (1 ， 2 ， ..”7) in order to match the 
notation in exercise 346， suppose U is the tree rooted at p having the edges p —— q, 
P — r, r —— s, r —— t, and let T(a) = {p,q,r,t}, T(b) = {p,r,s}, T(c) = {p,q}, 
T\d) = {q}^ T(e) = {r ， s} ， T(f) = {s} ， T\g) = {t}. Then aybycyd^cyeyf^ 

^ ^ 9 • The algorithm computes ^^abcdefg = (1 — a ) bcdefg 5 ^^bcdefg ~ (1 — b ) J\d C( i e j^g 5 

etc., where a = aMf/M bcdefg , b' = bM dfg /M cde f g = b(M d M f M g )/(M cd M e fM g ), etc. 

In general, the tree ordering guarantees that no “double-primed” variables are 
needed. Thus the formulas reduce to v 7 — vj Yl u _ v — u’) for each vertex v. 

(d) For example, we have pi = a ， … ， P 7 = g, Oi = a’，•. •, O 7 — g in (c). The 
values of the G’s, which depend on the ordering are uniquely defined by the given 
equations; and we have Mg (pi, … ,Pm) = (1 — … （1 — 6 m ) in any case. [W. Pegden, 
Random Structures & Algorithms 41 (2012) ， 546-556-] 

348• There is at least one singularity at z = pe for some 6 . If 0 < r < p, the power 
series f(z)= /( n ) {re L 0 ){z—re l0 ) n / n\ has radius of convergence p—r. \iz — p isn’t 

a singularity, the radius of convergence for 0 = 0 would exceed p — r. But |/( n )(re^)| = 
m% n (re ⑺) m_n I $ f ⑻ (r). [Mathematische Annalen 44 (1894) ， 41-42.] 

349 - Typical generating functions are 分 0000001 = 1 ; 如 110110 = 之 ( 分 0100110 + ^oiomo + 
分 0110110 + 分 oiiiiio)/4 (in Case 1) or ^0110110 = ^(^oooono+^ 0010110 +^ 0100110 +^ 0110110 )/4 
(in Case 2). These systems of 128 linear equations have solutions whose denominators 
involve one or more of the polynomials 4 — 2 ：, 2—z^ 16—12z^z 2 ^ 4 一 3 之 , 64-80z-\-24z 2 —z 3 ^ 
8 — 8 z + z 2 in Case 1 (see exercise 320); the denominators in Case 2 are powers of 4 — z. 

Setting g{z) = Yl x 9x(z)/128 leads to g{z) = 1/((2 — 2:)(8 — 82 ： + z 2 )) in Case 1 ， 
with mean 7 and variance 42; g{z) = (1088 — 4002 ： + 422： 2 — z s )/(4 — z ) 6 in Case 2, with 
mean 1139/729 ^ 1.56 and variance 1139726/729 2 2.14. 

[The upper bound E\ H - [-Eq is achieved by the distribution of Case 1， because 

it matches the extreme distribution ( 148 ) of the path graph Pq. Incidentally，if Case 1 
is generalized from n = 7 to arbitrary n，the mean is n(n — 1)/6 and the variance is 
(n + 3)(n + 2)n(n — 1)/90.] 

350. (a) The generating function for N is Yl^ =1 (^ — C/c)/(l — ^kz)\ so the mean and 
variance, in general，are Yjk=i 67(1 — 60 an d 67(1 — ^/c) 2 - In particular, the 

means are (i) n; (ii) n/(2n — 1); (iii) n/ (2 n — 1); (iv) H 2 n — H n + 士 = In 2 + 0(l/n); 

(v) + 士 — 2 ^+ 1 ) = ^ 0(l/n 2 ). The variance in case (i) is 2; otherwise it’s 

asymptotically the same as the mean, times 1 + 0 (l/n). 
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(b) In this case the mean and variance are《/(l —《）and ^/(l —《) 2 ， where 

C = Pr(A m ) = 1 - (1 -⑴… （1 - Q This value C is (i) 1 — 2~ n ; (ii) 1 - (1 - 忐 ) n = 
1 - e- 1/2 + 0(l/n); (iii) 1 - (1 - 2- n ) n = n/2 n + 0(n 2 /4 n ); (iv) 1/2; (v) l/(2n + 2). 
Hence the respective means are (i) 2 n — 1; (ii) e 1 ’ 2 — l + 0(l/n); (iii) n/2 n +0(n 2 /4 n ); 
(iv) 1; (v) l/(2n + 1). And the variances are (i) 4 n — 2 n ; (ii) e — e 1 / 2 + 0(l/n); 
(iii) n/2 n + 0(n 2 /4 n ); (iv) 2; (v) l/(2n + 1) + l/(2n + l) 2 . 

(c) Since G is exercises 336 and 343 imply that (Ci, …，“, €) G 7^((7) if 

and only if ^ This condition holds in cases (ii), (iii), and (v). 

351. (Solution by Moser and Tardos.) We require i —— j if there’s a setting of the 
variables such that Ai is false and Aj is true, provided that some change to the variables 
of Ej might make Ai true. And vice versa with i ^ j. 

(The Local Lemma can be proved also for directed lopsidependency graphs; see 
Noga Alon and Joel H. Spencer, The Probabilistic Method (2008), §5 丄 But the theory 
of traces, which we use to analyze Algorithm M, is based on undirected graphs, and no 
algorithmic extension to the directed case is presently known.) 

352. Answer 344(e)，with Mg = /3(iU J), M G \i = 卢 (J)，proves that Mq^/Mg > 1 —ft- 

353. (a) There are n + 1 sorted strings in Case 1， namely Q k l n ~ k for 0 < k < n. There 
are F n +2 solutions in Case 2 (see，for example, exercise 7.2.1.1—91). 

(b) At least 2 n Mcf(l/4)，where G is the path P n -i- By exercise 320 we have 
Mg( 1/4) = / n —i(l/4) = (n+ l)/2 n ; so Case 1 matches the lower bound. 

(c) There are no lopsidependencies. Hence the relevant G is the empty graph on 
m = n—1 vertices; Mg (1/4) = (3/4 ) n_1 by exercise 336; and indeed, F n +2 > 3 n_1 2 2_n . 

354. Differentiate ( 151 ) and set z <— 1. 

355. If A = Aj is an isolated vertex of then 1 — pjz is a, factor of the polynomial 

Mg(z) in ( 149 )，hence 1 + ^ < 1/pj ； and Ej = Pj/(1 — pj) < 1/S. Otherwise 

Mg(pu... ,Pi-1,Pi(l + ， Pm ) = Ma(l)- S Pj M^ A , (1)'> MS(1 +0 = 0; 

so 馬 • = 朽呢 \^( 1 )/啦 ⑴ > 1 /& 

356. (a) We prove the hint by induction on |外 It’s obvious when 5" = 0; otherwise 
let X = S n [J i£Uj Uj and y = 5 \ X. We have 

Pr(A. I 石 ) = Pr (^nAxnl ： y) < Pr(^nAy) 〈 Pr(AQPr(Ay) = Pr ⑷ 

Pr(Ax A Ay) Pr(Ax A Ay) Pr(Ax n Ay) Pr(Ax \ Ay) 

by ( 133 ) - Suppose i belongs to the cliques Uj Q , …， Uj r where j = jo. Let X 0 = 0 
and Xk = (S D Uj k ) \ Yk = Y U Xi U • • • U for 1 < A: < r. We have 

Pr(Ai I Ay k ) < 0i jk for all l e X k , since 叫 < |5| when X k ^ 0; hence Pr(A Xk \ Ay k ) > 
(1 + 0 ijk - U jk ). Thus Pt(Ax\A y ) = Pr(A Xl \A Yi ) Pt(Ax 2 \Ay 2 ) ... Pv(Ax r \A Yr ) > 
Y\k^j,ieu h 0~ ^ ^ ik ~ X fc )，by the chain rule (exercise MPR-14); the hint follows. 

Finally let Wk = t/i U • • • U for 1 < k < t. The hint implies that 

Pr(-4i D ... fl A m ) = Pr(A^) Pr(Aw 2 \ ) ... Pr(Aw t \ Aw t ^ x ) 

> (i_r 1 )(i_r 2 )...(i_r i ) > 0 . 

(b) The extreme events Bi, … ， B m of Theorem S satisfy the hint of (a)- Thus 

I Bk) S 

Furthermore E{ = ^/(l — in ( 152 )，because qi = piM G \^ /M G \^ 


/{l+Oij-Ej). 


Pr(Bi I B k) < Oij for all i G Uj] hence qi = Pr(Bi 
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(c) Let t/i ，... ^ Ut be the edges of with Oik = Oi when Uk = {i, j}. Then 
Uk = Oi+Oj < 1 5 and the sufficient condition in (a) is that Pr(A^) < 6i Ylj^k.i~j (1 — 心） 
whenever i —— k. (But notice that Theorem M does not hold for such larger pi.) 

[K. Kolipaka, M. Szegedy, and Y. Xu, LNCS 7408 ( 2012 ), 603 - 614 .] 

357. If r > 0, we have x = r/(l—p), y = r/ (1—^). But r = 0 is possible only on the axes 
of Fig. 51: Either (p^q) = (0,1)，x = 0, 0 < y < 1， or (p ， g) = (1 ， 0)，0 < x < 1， y = 1. 

358. Suppose x > y (hence p > q and x > 0). Then p < r if and only if 1 — ?/ < ?/• 

359. Instead of computing tti by formula ( 154 )，represent it as two numbers ( 丌广，丌， ’）， 
where 7 r ; + is the product of the nonzero factors and tt[ is the number of zero factors. 
Then the quantity 717 needed in ( 156 ) is 7 r^[ 7 rJ- = 0]; and the quantity 7 T //(1 — rjc-^i) is 

7 T 广 {k[ = 1 ] if rjc-^i = 1 , otherwise it’s 7 r 广 {k[ = 0 ]/(l — rjc^i)- A similar method can be 
used to separate out the zero factors of Yli ec in ( 157 ). 

360. We may assume that 773 = 0. Since 7T/ = 1 implies that r/c-^i = = 0， we 

have T] C -^i = Vc -^2 = Vc-^3 = Vc-^4 = 7l->c = 72 - = 7“c = 744 c = 0 for 
all C. Consequently, as in ( 159 ), all but three of the values rjc-^i are zero; let z 
denote the others. Also let r/i = a, rj 2 = b, r] 4 = c, u = d. Then 7Ti = (1—a)(l —$)， 
7 T 2 = (1—6)(1— 7 T 4 = (1—c)( 1 — 2 ：), and = 1 — d. A fixed point is obtained if 
x = d(b + cd(l—b) + ad 2 (l—b)(l—c))/(1 — d 3 (l—a)(l—6)(1—c)), etc. If cf is 0 or 1 then 
x = y = z = d. [Are there any other fixed points，say with 7 Ti ^ 1?] 

361. The 7 r’s and 7 ’s will also be either 0 or 1, and we exclude the case 丌 / = 717 = 0; thus 
each variable v is either 1 ， 0 , or *, depending on whether ( 〜，丌 。） is ( 0 , 1 ),( 1 ， 0 )，or ( 1 ， 1 ). 

Any assignment of 1 ， 0, or * to the variables is permissible, provided that every 
clause has at least one literal that’s true or two that are *. (Such partial assignments 
are called “covering,” and they’re usually possible even with unsatisfiable clauses; see 
exercise 364 .) All survey messages r( C -^i = Vc-^i are zero except when clause C has l 
as its only non - false literal- The reinforcement message 77 / can be either 0 or 1， except 
that it must be 1 if / is true (717 = 0 ) and all messages r]c-^i are 0 . 

If we also want r/5 = r] h we take n = 1 in ( 158 ), and rji = 1 — 7 T/. 


362. Create a linked list I/，containing all literals that are to be forced true, including 
all literals that are in 1-clauses of the original problem. Do the following steps while L 
is nonempty: Remove a literal l from L; remove all clauses that contain Z; and remove 
l from all the clauses that remain. If any of those clauses has thereby been reduced 
to a single literal, (/’），check to see if l f or V is already present in L. If V is present, 
a contradiction has arisen; we must either terminate unsuccessfully or restart step S8 
with increased i/j. But if V and l f are both absent, put l f into L. 

363• (a) True; indeed, this is an important invariant property of Algorithm C. 

(b) W (001) = 1, W (***) = pip 2 P 3 ^ otherwise W(x) = 0. 

(c) Statements (i) and (iii) are true, but not (ii); consider x = 10*, x = 00*，and 



the clause 123. 

(d) All eight subsets of {1, 2 , 3 } are stable except { 2 , 3 }, because x\ is 
constrained in 100- The other seven are partially ordered as shown. (This 
diagram illustrates L 7 , the smallest lattice that is lower semimodular but not modular.) 

(e) X 2 XS = 00 01 0* 10 11 1* *0 *1 ** 

xi =0 0 qiq 2 0 qiq3 qiq‘ 2 qs qiq2P3 0 qiP2q3 qiP2P3 

xi = 1 q2q3 qiq‘2q3 qiq2P3 qiq^qz qiq2q3 qiq2P3 qip^qz qiP2q3 qiP2P3 

xi = * 0 piq2q3 piq2P3 Piq2q3 piq^qz pi ^3 PiP2q3 piP2q3 P 1 P 2 P 3 
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(f) One solution is {12345, 14, 25, 345, 345}. (For these clauses the partial assign¬ 
ment {3} is stable, but it is “unreachable” below {1 ， 2, 3, 4, 5}.) 

(g) If L = L’ \ l and L 7 G £ but L ^ introduce the clause (xi V \/ keL f Xk)- 

(h) True, because L f = L\l f and L" = L \ 广 ’， where |/ 7 | and \l n \ are uncon¬ 
strained with respect to L. A variable that’s unconstrained with respect to L is also 
unconstrained with respect to any subset of L. 

(i) Suppose L' = L ,(0) ■< —— < L’ ⑷ =and L" = L’’ ⑼ < - < = 

{1,..., n}. Then L’( s_z ) D L’’ ( 卜 J ) is stable for 0 < « < 5 and 0 < j < by induction 
on i + j using (h). 

(j) It suffices to consider the case L = {1 ， … ， n}. Suppose the unconstrained 
variables are xi^ X 2 ^ x^>. Then, by induction, the sum is qiq 2 q 3 + pi + P 2 + P 3 — 
(P 1 P 2 + piP 3 + P 2 P 3 ) +P 1 P 2 P 3 = 1， using “inclusion and exclusion” to compensate for 
terms that are counted more than once. A similar argument works with any number 
of unconstrained variables. 

Notes: See F. Ardila and E. Maneva, Discrete Mathematics 309 (2009) ， 3083- 
3091- The sum in (j) is < 1 when each pk+qk < 1 for 1 < A: < n, because it is monotone. 
Because of (i), the stable sets below L form a lower semimodular lattice，with 

L' A L" = L’ Cl L" and L' V L "= 门 {L"’ | I/" D L f U L" and L'" C L}. 

E. Maneva and A. Sinclair noted in Theoretical Comp. Sci. 407 (2008) ， 359-369 that a 
random satisfiability problem is satisfiable with probability < W{X)^ the expected 

total weight of partial assignments having the given distribution, because of identity (j); 
this led them to sharper bounds than had previously been known. 

364. (a) True if and only if all clauses have length 2 or more. 

(b) 001 and *** are covering; these are the partial assignments of nonzero weight, 
when ^1 = • • • = g n = 0 in the previous exercise. Only 001 is a core. 

(c) *** is the only covering and the only core; VF(0101) = W (0111) = q^. 

(d) In fact，every stable partial assignment L r has a unique covering assignment L 
with L [ L’，namely L = f]{L n | L" □ L’，obtained by successively removing 
unconstrained literals (in any order)}. 

(e) If L' and L" are adjacent we have L' f) L" C L' and L' D L" C L". 

(f) Not necessarily. For example, the clauses {1234, 1234, 1234, 1234, 1234, 1234} 
define 5"2($1, 尤2 , 尤3 , 為 ）； there are two clusters but only an empty core. 

[A. Braunstein and R- Zecchina introduced the notion of covering assignments in 

J. Statistical Mechanics (June 2004), P06007:l-18.] 

365. If L is any of the six solutions in (8)，and if q is odd，then qL—d is a covering assign¬ 
ment for 0 < d < ^ and 8q—d <n< 9q—d. (For example, if L = {H 3,4, 5, 6, 7, 8} the 
partial assignment 3L — 1 = {H 8,11 ， II ， 17, 20, 23} works for n G [23 • • 25].) Thus all 
n > 63 are “covered •” [Do all nonempty coverings of waerden (3, 3; n) have this form?] 

366. Eliminating variable 1 (xi) by resolution yields the erp rule x\ (x 2 V X 3 ) A 
(xs V X 4 )，and new clauses {234,234,234, 234}. Then eliminating 2 (x‘ 2 ) yields X 2 - 
(x 3 V xa) A (X 3 V xa) and new clauses {34, 34}. Now 4 (^ 4 ) is pure; so X 4 1， and 
F 7 = 0 is satisfiable- (Going backwards in the erp rules will then make *<— 1, X 2 <— 1, 
xi 0, regardless of x^.) 

367. (We can choose whichever of the two assignments is most convenient，for example 
by picking the shortest，since either one is a valid erp rule.) Any solution will either 
satisfy all the clauses on the right side of x or all the clauses on the right side of x, or 
both. For if a solution falsifies both Ci\x and Cj \ it falsifies C { o Cj. 
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In either case the value of x will satisfy all of the clauses C^， ••” C a , C、, • • • ， C f h . 

368. If ⑴ is a clause，subsumption removes all other clauses that contain 1. Then 
resolution (with p = 1) will remove l from all q of its clauses, and (Z) itself- 

369. Let Ci = (l VaJ and Cj = (IW(3j). Each omitted clause Ci<>Cj = (a^V (3j)^ where 

1 < i < p and r < j < is redundant, because it is a consequence of the non-omitted 
clauses (叫 V 『 1 )， • • • ， {ai V『 r )，（Zi V • • • V V 你 ） via hyperresolution. [This technique 
is called “substitution，” because we essentially replace |/| by its definition.] 

370. (a V 6) A (a V 6 V c) A (a V 6 V c) = (a V c) A (6 V c). (See the discussion following 
7.1.1—( 27 ) • In general, advanced preprocessors use the theory of DNF minimization, in 
its dual form, to find irredundant minimum forms for CNF. Such techniques are not 
implemented, however, in the examples of preprocessing considered in this section.) 

371. One scenario starts by eliminating variable 1, replacing eight clauses by eight new 

ones: 2347, 2347, 2359, 2359, 3457, 3457, 4579, 4579. Then 8 is eliminated，replacing 
another eight by eight: 2456, 2456, 2567, 2567, 2579, 2579, 4679, 4679. Then come 
self-subsumptions: 2347 237 (via 234), 3457 357 (345), 357 4 35 (357); and 

35 subsumes 345, 357. Further self-subsumptions yield 2359 ^ 239， 2359 239, 

2579 279, 2456 »-> 246, 246 4 46; and 46 subsumes 456, 4679, 246. Similarly, 

2567 267, 4579 ^ 459, 2347 237, 3457 357, 357 ^ 35; and 35 subsumes 345, 

357. Then 2456 246, 246 46; and 46 subsumes 456, 246, 4679. Also 2567 267, 

4579 459, 2579 279. 

Round 2 of variable elimination first gets rid of 4, replacing six clauses by just 
four using exercise 369: 236, 236, 569 ， 569. Then variable 3 goes away; ten clauses 
become eight, again via exercise 369: 256, 256, 257, 257, 259, 259, 569, 569. And the 
ten clauses that now contain 2 or 2 resolve into just four: 5679, 5679, 5679, 5679. 

After eliminating 7 and 9, only four clauses remain, namely 56 ， 56 ， 56 ， 56; and 
they quickly produce a contradiction. 

372. (This problem is surprisingly difficult.) Are the clauses {15, 16, 25, 26, 37, 38, 
47, 48, 123, 124, 134, 234, 567, 568, 578, 678} as “small” as possible? 

373. Using the notation of ( 102 ), elimination of X 2 m^ •…, x mm produces new 
clauses M[ mk for 1 < i, A: < m as well as M m ( m — 工 ）. Then elimination of 

gives VM m ( m _ 2 )) for 1 < i < m. This clause self-subsumes to using 

MimU … ， M im(jn _ 2 y And subsumes each M’ imk , so we’ve reduced m to m — 1. 

374. As in ( 57 )，variables are numbered 1 to n, and literals from 2 to 2n + 1- But we 
will now number the clauses from 2n + 2 to m + 2n + 1- The literals of clauses will be 
stored in cells, somewhat as in Algorithm A, but with additional links as in the exact 
cover algorithms of Section 7.2.2.1: Each cell p contains not only a literal L(p) ， a clause 
number C(p), and forward/backward pointers F(p) and B(p) to other cells with the 
same literal, but also left / right pointers S(p) and Dip) to other cells in the same clause. 
(Think “sinister” and “dexter.”）Cells 0 and 1 are reserved for special use; cell Z，for 

2 < / < 2 n + 2 , serves as the head of the doubly linked list of cells that contain the 
literal /; cell c，for 2 n + 2 < c < m + 2 n + 2 , serves as the head of the doubly linked list 
of cells that contain the elements of clause c; and cell p, for m + 2 n + 2 < p < M, either 
is available for future use or holds literal and clause data for a currently active clause. 

Free cells are accessed via a global pointer AVAIL. To get a new p 4= AVAIL when 
AVAIL ^ 0, we set p AVAIL，AVAIL l S(AVAIL); but if AVAIL = 0, we set p ^ M and 
M ^ M + 1 (assuming that M never gets too large). To free one or more cells from p 
to p" that are linked together via left links，we set S(p 7 ) ^ AVAIL and AVAIL ^ p n . 
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The number of active clauses containing literal Z, TALLY (D, can therefore be 
computed as follows: Set t ^ 0, p F(/); while not lit (p), set t f t +1 and p ^ F (p) ; 
set TALLY(/) ^ t] here c lit {p)’ stands for c p < 2n+2' The number of literals in clause c, 
SIZE(c) ， can be computed by a similar loop, using C cls (p)’ to stand for < m + 2n + 2 ’： 
Set t 0, p S (c); while not els {p), set t t + 1 and p ^r- S(p); set SIZE(c) ^ t. 
After initialization, the TALLY and SIZE statistics can be updated dynamically as local 
changes are made. (TALLY(/) and SIZE(c) can be maintained in L(0 and C(c)•) 

To facilitate resolution, the literals of each clause are required to increase from 
left to right; in other words, we must have L(p) < L(q) whenever p = S(q) and 
q = D(p), unless cls(p) or els (q). But the clauses within literal lists need not appear 
in any particular order. We might even have C(F(p)) > C(q) but CCFCp 7 )) < C(〆 ）， 
when C(p) = C(p’）and C(^) = C(q). 

To facilitate subsumption, each literal l is assigned a 64-bit signature SIG(D = 
(l 《 f/i) I (l 《 f/ 2 )，where U\ and U 2 are independently random 6-bit numbers- Then 
each clause c is assigned a signature that is the bitwise OR of the signatures of its 
literals: Set t l 0, p l S(c); while not ds (p\ set t ^ t \ SIG(L(p)) and p <- Sip ); 
set SIG(c) t. (See the discussion of Bloom’s superimposed coding in Section 6.5.) 

(a) To resolve c with c’，where c contains l and c contains 『， we essentially want 

to do a list merge- Set p q S(c), u L (q), q’ i— S(c)^ u l L(g’），and 

do the following while u + u >0: u = u\ copy(w) and bump(g 5 ^ / ); if u = u = Z ， 
bump(g,g / ); if u = u ^ terminate unsuccessfully; otherwise if u > v!, copy (it) 
and bump(^); otherwise copy( / u / ) and bump(^ / ). Here c copy( / u)^ means c set p p, 
p ^ AVAIL ， S(p’）I p ， L(p) ^ ‘bump(g)’ means ‘set q i- S(q); if cls(q) set 从 I 0, 
otherwise set u L(g )’； c bump(^ / )^ is similar，but it uses q and u \ and c bump(g ， g’)’ 
means c bump(g) and bump(g’)’. Unsuccessful termination occurs when clauses c and 
c resolve to a tautology; we set p 卜 0， after first returning cells p through S(l) to 
free storage if p ^ 1. Successful termination with u = u = 0 means that the resolved 
clause consists of the literals in cells from p through S(l), linked only via S pointers. 

(b) Find a literal l with minimum TALLY(0. Set p l F(D，and do the following 
while not lit(p): Set c C(p); if c’ — c and 〜 SIG(c’）& SIG(c) = 0 and SIZECc 7 ) > 
SIZE(c), do a detailed subsumption test; then set p F (p). The detailed test begins 
with q ^ Sic) ^ u ^ L(q) ^ q f S (c 7 ), u t L(gO, and does the following steps while 
u > u > 0: bump(^ / ) while u > u\ then bump(^^ / ) if u = u. When the loop 
terminates, c subsumes c if and only if u < u . 

(c) Use (b) but with (SIG(c) & 〜 SIG(D) in place of SIG(c) - Also modify the 
detailed test, by inserting c if u = l then u just after each occurrence oi^u ^ L(q) \ 

[The algorithm in (b) was introduced by A. Biere, LNCS 3542 (2005), 59-70, §4.2. 
“False hits，” in which the detailed test is performed but no actual (self-)subsumption 
is detected，tend to occur less than 1% of the time in practice.] 

375. Let each literal l have another field STAMP(D, initially zero; and let 5 be a global 
“time stamp” that is initially zero. To make the test，set 5 5 + 1 and 0; then 

set STAMP(w) s and a a \ SIG(u) for all u such that (lu) is a clause. If _ 0, 
set a <— a \ SIG(D and run through all clauses c that contain Z，doing the following: If 
SIG(c) & 〜 a = 0, and if each of c^s literals u ^ l has STAMP ( 从 ）= 5, exit with Ci = c 
and r = SIZE(c) — 1- If C\ has thereby been found, set 5 5 + 1 and STAMP (it) ^ 5 
for d\\ u ^ l in c. Then a clause (IW /3j) implicitly has j < r in the notation of exercise 
369 if and only if f3j is a single literal u with STAMP (ix) = 5- 

Given a variable test the condition first for I = x; if that fails，try l = x. 
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376. Highest priority is given to the common operations of unit conditioning and pure 
literal elimination, which are “low - hanging fruit.^ Give each variable x two new fields, 
STATE (x) and LINK(x). A “to—do stack，” containing all such easy pickings, begins at 
TODO and follows LINKs until reaching A. Variable x is on this stack only if STATE(x) is 
nonzero; the nonzero states are called FF (forced false )， FT (forced true), EQ (eliminated 
quietly), and ER (eliminated by resolution). 

Whenever a unit clause (Z) is detected, with STATE(|/|) = 0, we set STATE(|/|) - 
(/ & 1?FF:FT), LINK (|/|) 卜 T0D0, and TODO 卜 |Z|. But if STATE(|/|) = (Z & 1? FT: FF )， 
we terminate, because the clauses are unsatisfiable- 

Whenever a literal with TALLY(D = 0 is detected, we do the same thing if 
STATE(|/|) = 0. But if STATE(|/|) = (/ & 1?FT:FF)，we simply set STATE(|/|) ^ EQ 
instead of terminating. 

To clear the to-do stack, we do the following while TODO — A: Set x TODO and 
TODO 卜 LINK($); if STATE(x) = EQ, do nothing (no erp rule is needed to eliminate x)\ 
otherwise set l ^ (STATE (x) = FT? x: x)^ output the erp rule 111, and use the doubly 
linked lists to delete all clauses containing l and to delete l from all clauses- (Those 
deletions update TALLY and SIZE fields, so they often contribute new entries to the to- 
do stack. Notice that if clause c loses a literal, we must recompute SIG(c). If clause c 
disappears, we set SIZE(c) 卜 0, and never use c again.) 

Subsumption and strengthening are next in line- We give each clause c a new field 
LINK(c )， which is nonzero if and only if c appears on the “exploitation stack •” That 
stack begins at EXP and follows LINKs until reaching the nonzero sentinel value A 7 . All 
clauses are initially placed on the exploitation stack• Afterwards，whenever a literal l is 
deleted from a clause c, either during unit conditioning or self-subsumption, we test if 
LINK (c) = 0; if so, we put c back on the stack by setting LINK(c) EXP and EXP <— c. 

To clear the subsumption stack, we first clear the to-do stack. Then, while EXP ^ 
A 7 , we set c EXP，EXP ^ LBJK(c )， and do the following if SIZE(c) ^ 0: Remove 
clauses subsumed by c; clear the to-do stack; and if SIZE(c) is still nonzero, strengthen 
clauses that c can improve, clear the to - do stack，and set TIME(c ) 卜 T (see below). 

All of this takes place before we even think about the elimination of variables. But 
rounds of variable elimination form the “outer level” of computation. Each variable x 
has yet another field, STABLE (x) , which is nonzero if and only if we need not attempt to 
eliminate x. This field is initially zero, but set nonzero when STATE (x) *<— EQ or when 
an erp rule for x or x is output. It is reset to zero whenever a variable is later “touched,” 
namely when x or x appears in a deleted or self-subsumed clause. (In particular, every 
variable that appears in a new clause produced by resolution will be touched, because 
it will appear in at least one of the clauses that were replaced by new ones.) 

If a round has failed to eliminate any variables, or if it has eliminated them all ， 
we’re done. But otherwise there’s still work to do, because the new clauses can often 
be subsumed or strengthened. (Indeed，some of them might actually be duplicates.) 
Hence two more fields are introduced: TIME(D for each literal and TIME(c) for each 
clause, initially zero- Let T be the number of the current elimination round. We 
set TIME(/) T for all literals l in all clauses that are replaced by resolution, and 
TIME (c) ^ T is also set appropriately as mentioned above- 

Introduce yet another field, EXTRA (c) ， initially zero. It is reset to zero whenever 
TIME(c) l T 7 ，and set to 1 whenever c is replaced by a new clause. For every literal 
l such that STATE(|/|) = 0 and TIME(/) = T at the end of round T, set EXTRA(c) 
EXTRA(c) + 4 for all clauses c that contain Z，and EXTRA (c) ^ EXTRA (c) | 2 for all 
clauses c that contain I. Then run through all clauses c for which SIZE(c) >0 and 
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TIME(c) < T. If SIZE(c) = EXTRA (c) 》 2, remove clauses subsumed by c and clear the 
exploitation stack. Also，if EXTRA (c) & 3 _ 0， we may be able to use c to strengthen 
other clauses —unless EXTRA (c) & 1 = 0 and EXTRA (c) > 2 < SIZE(c) - l._ Self¬ 
subsumption using l need not be attempted when EXTRA (c) &1 = 0 unless TIME(0 = T 
and EXTRA(c) > 2 = SIZE(c) - [TIME ⑴ =T]. Finally, reset EXTRA(c) to zero (even if 
TIME(c) =T). [See Niklas Een and Armin Biere, LNCS 3569 (2005), 61-75.] 

377. Each vertex v of G corresponds to variables vi, V 2 ^ V 3 in F; each edge u —— v 
corresponds to clauses (ui V 仍 ), (u2 V V3), (U3 V v\)^ (U2 V v\)^ (U3 V 62 ), (^1 V 1 ) 3 ) - The 
longest paths in the dependency digraph for F have the form ti U 2 — > ^3 — or 
ti U 3 ^ V 2 wi^ where t —— u —— v —— w is a walk in G. 

[A similar method reduces the question of finding an oriented cycle of length r in a 
given digraph to the question of finding a failed literal in some dependency digraph. The 
cycle detection problem has a long history; see N. Alon, R. Yuster, and U- Zwick, Algo- 
rithmica 17 (1997), 209-223. So any surprisingly fast algorithm to decide whether or 
not failed literals exist —— that is，faster than n 2u; /( CJ + 1 ) when m = O(n) and matrix mul¬ 
tiplication takes O(n^) —— would lead to surprisingly fast algorithms for other problems*] 

378. The erp rule l *<— Z V (『1 八 … 八乙 ) will change any solution of F\C into a solution 
of F. [See M. Jarvisalo, A. Biere, and M. Heule, LNCS 6015 (2010), 129-144.] 

(In practice it’s sometimes possible to remove tens of thousands of blocked clauses. 
For example, all of the exclusion clauses ( 17 ) in the coloring problem are blocked, as 
are many of the clauses that arise in fault testing. Yet the author has yet to see 
a single example where blocked clause elimination is actually helpful in combination 
with transformations 1—4, which are already quite powerful by themselves.) 

379. (Solution by O. Kullmann.) In general, any set F of clauses can be replaced by 
another set F,, whenever there’s a variable x such that the elimination of x from F 
yields exactly the same clauses as the elimination of x from F r . In this case the elimi¬ 
nation of a has this property- The erp rule a 卜 a V (bAcAd) is necessary and sufficient- 

380 • (a) Reverse self-subsumption weakens it to (a V 6 V c V d) ， then to (a V 6 V c V d V e) ， 
which is subsumed by (aWdWe). [In general one can show that reverse self-subsumption 
from C leads to a subsumed clause if and only if C is certifiable from the other clauses-] 

(b) Again we weaken to (a V 6 V c V d V e); but now we find this blocked by c. 

(c) No erp rule is needed in (a)，but we need c 4 - c V (a A 6 ) in (b). [Heule, 
Jarvisalo, and Biere ，LNCS 6397 (2010), 357—371，call this “asymmetric elimination.’’] 


381. By symmetry ， we’ll remove the final clause- (Without it, the given clauses state 
that xi < X 2 < • • • < x n ] with it, they state that all variables are equal-) Assume more 
generally that，for 1 < j < n, every clause other than {xj V Xj+i) that contains Xj 
also contains either x n or Xi for some i < j. For 1 < j < n — 1 we can then weaken 

V • • • V x j+ i V 


x n )- Finally, (xi V • • • V x n -i V x n ) can be 


{xi V • • • V Xj V x n ) to {xi 


eliminated because it is blocked by x n -i. 

Although weVe eliminated only one clause, n — 1 erp rules are actually needed 
to undo the process: xi <— xi V x n ; X 2 X 2 V (xi A x n ); ^3 ^3 V {x\ A X 2 A x n )] •…； 

Xn-i Xn-iW (xi A - - • Ax n _2 Ax n ). (Those rules, applied in reverse order, can however 
be simplified to Xj Xj V x n for 1 < j < n, because xi < — • < x n in any solution.) 
[See Heule, Jarvisalo, Biere, EasyChair Proc. in Computing 13 (2013), 41-46.] 

382. See M. J. H. Heule, M. Jarvisalo, and A. Biere, LNCS 6695 (2011), 201-215. 

383. (a) In a learning step，let ^ ^ and = ^UC. In a forgetting step, let $ 

and ^ ^ U C. In a hardening step, let = $ U (7 and ^ ^ U C. In a softening 
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step，let $ = U (7 and ^ U C. In all four cases it is easy to verify that (sat ( 企 ) 
sat($ U 少 ))implies (sat($) 4=^ sat($’) 4=^ sat($’ U 屯 ’))，where sat(G f ) means 
U G is satisfiable，” because sat(G U G r ) => sat(G) - Thus the assertions are invariant- 

(b) Each erp rule allows us to go one step backward, until reaching F. 

(c) The first (softening) step is fine, because both $ = (x) and $ \ (x) = 1 are 

satisfiable, and because the erp rule unconditionally makes x true* But the second 
(learning) step is flawed, because sat (伞 U 屯 ) does not imply sat (伞 U ^ U (7) when 
$ U ^ = (x) and C = (x). (This example explains why the criterion for learning is not 
simply c sat($) sat($ U (7)’ as it essentially is for softening.) 

(d) Yes，because C is also certifiable for $ U 少 . 

(e) Yes，after softening it. No erp rule is needed，because $ \ (7 h C. 

(f) A soft clause can be discarded whether or not it is subsumed- To discard a 
hard clause that is subsumed by a soft clause, first harden the soft one- To discard a 
hard C that is subsumed by a hard C\ weaken C and then discard it. (The weakening 
step is clearly permissible, and no erp rule is needed.) 

(g) If C contains x and (7’ contains x and C \ x C C r \ we can learn the soft 
clause C oC f = C’ \ x, then use it to subsume C r as in (f). 

(h) Forget all soft clauses that contain x or x. Then let Ci^ ^ C p be the hard 
clauses containing and C、, • • • ， C r q those containing x. Learn all the (soft) clauses 
Ci oC f j, and harden them, noting that they don’t involve x. Weaken each C“ with erp 
rule x xWCi^ and forget it; also weaken and forget each C’j, with erp rule x ^ xACj. 
(One can show that either of the erp rules in ( 161 ) would also suffice.) 

(i) Whenever $U 少 is satisfiable, so is $U 屯 U {(xVz) ， (y\/z)^ (x\/yWz)}^ because 
we can always set z x\/ y. 

[Reference: M. Jarvisalo, M. Heule, and A. Biere, LNCS 7364 (2012), 355-370. 
Notice that, by exercise 368, parts (f) and (h) justify the use of unit conditioning.] 

384. Whenever we have a solution to ^ \ C that falsifies (7, we will show that $ is 
satisfied by making l true; hence softening C is permissible, with erp rule l <— l W C. 

To prove that claim, notice that a problem could arise only in a hard clause C r 
that contains l. But if all other literals of C r are false in the given solution, then all 
literals of C oC f are false，contradicting the assumption that (^\C) A C oC f hi e. 

(Such clauses C are “resolution certifiable” with respect to ^\C. Blocked clauses 
are a very special case. Similarly, we can safely learn any clause that is resolution 
certifiable with respect to $ U 屯 •） 

385. (a) True, because C Al hi e. 

(b) 1 is implied，not certifiable; 12 is certifiable, not absorbed; 123 is absorbed. 

(c ， d) If (7 is any clause and l is any literal, then F AC hi l implies F A (7 hi 
because unit propagation in F carries over to unit propagation in F f • 

386. (a) The trail contained exactly score(F, l) literals when decision l was made at 
level d. The clause learned from the ensuing conflict causes at least one new literal to 
be implied at level d! < d. 

(b) The score can’t decrease when F grows. 

(c) Each l G C needs at most n helpful rounds to make score(F, (7,/) = oo. 

(d) Suppose, for example, F = (a V J) A (a V 6 V e V /) A (a V c) A (6) A (c V c? V e V /) 

and C = (a V 6 V c V d V /)• The helpful sequences of decisions are ( 乙， 『)， 

(d,d,cj), and they occur with probabilities To^h To 
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In general if a decision is to be made and j elements of C are not yet in the trail, 
the probability that a suitable decision will be made at random is at least 

/(n,j) = min^^-i/(n-l,j- 1 ), ^^y/(n- 2 ,j- 2 ), 

_ _ _i) _1_^ — Ij.rll' 

2 (n—j+ 2 ) An J 十， 2(n-j + l)J ~ 23ni ' 

(e) The waiting time to absorb each clause Ci is a geometric distribution whose 
mean is < Av) Ci \ repeated at most \Ci\n times. 

References: K. Pipatsrisawat and A. Darwiche, Artif. Intell. 175 (2011) ， 512-525; 
A. Atserias ， J. K. Fichte, and M. Thurley, J. Artif. Intell. Research 40 (2011), 353-373. 

387. We may assume that G and G r have no isolated vertices. Letting variable vv 

mean that v corresponds to we need the clauses {uv r V vv r ) for u < v and {yu r V vv r ) 

for u < v r . Also, for each u < v with u —— v in we introduce auxiliary variables 

uu vv for each edge u —— v in G f , with clauses {uu r vv r V uu) A (uu f vv f V vv) A 

(vw vv I u —— v in G f }). The variables vv and uu vv can be restricted to cases 

where degree(?x) < degree ( 乜 ’）and degree(v) < degree (i； 7 ). 

388. (a) Can the complete graph Kk be embedded in G? (b) Can G be embedded in 
the complete A:-partite graph K n n , where G has n vertices? (c) Can the cycle C n be 
embedded in G? 

389. This is a graph embedding problem，with G r the 4x4 (king U knight) graph and 
with G defined by edges T —— H, H —— E, …， N —— G. The adjacent Ms can be avoided 
by changing ‘PROGRA 丽 ING’ to either ‘PROGRAMXING’ or ‘PROGRAXMING’. 

Algorithm C needs less than 10 megamems to find the first solution below. 
Furthermore, if the blank space can also be moved, the algorithm will rather quickly 
also find solutions with just five knight moves (the minimum)，or 17 of them (the max): 
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390 . Let d 、 u,v) be the distance between vertices u and v. Then d(v, v) = 0 and 

d 、 u,v) < j + 1 d{u^ v) < j or d(u^ w) < j for some w G N(v) = {w \ w —— v}. (*) 

In parts (a) ，（ d)，we introduce variables Vj for each vertex v and 0 < j < k. In part (c) 
we do this for 0 < j < n. But parts (b) ， (e )， （ f) use just n variables, {v \ v £ V}. 

(a) Clauses (5 0 ) 八 \J veV \ s (v 0 ) A \J veV (v j+ i\ / vj V \/ w _^ wj) are satisfied only if 
Vj < [d(s^ v) < j]; hence the additional clause (tk) is also satisfied only if d(s^t) < k. 
Conversely，if d(s^ t) < all clauses are satisfied by setting Vj <— [d(s^ v) < j]. 

(b) There’s a path from 5 to t if and only if there’s a subset H C V such that 

s G H, t & H , and every other vertex of the induced graph G | H has degree 0 or 2. 
[The vertices on a shortest path from s to t yield one such H. Conversely, given H, we 
can find vertices Vj G H such that s = vo —— vi - - - Vk = t.] 

We can represent that criterion via clauses on the binary variables v = [v ^ H] 
by asserting (s) A ⑷， together with clauses to ensure that S(s) = S ⑷ = 1， and that 
S(i；) G {0, 2} for all v G H\{s^ t}^ where ^(v) = w IS degree of v in G\H. 

The number of such clauses for each v is at most 6|iV(r)|，because we can append v 


geometric distribution 

Pipatsrisawat 

Darwiche 

Atserias 

Fichte 

Thurley 

isolated vertices 
auxiliary variables 
complete graph 
complete A；-partite graph 
cycle 

graph embedding 
distance d{u^ v) in a graph 
induced graph 
shortest path 


September 23, 2015 






7.2.2.2 


ANSWERS TO EXERCISES 


263 


to each clause of ( 18 ) and (lg) when r = 2, and |7V(v)| additional clauses will rule out 
S(v) < 2. Altogether there are 0{m) clauses，because |A^(v)| = 2m. 

[Similar but simpler alternatives, such as (i) to require ^(v) G {0,2} for all v G 
V\{s ， t}^ or (ii) to require D(v) > 2 for all v G H\{s^ t}^ do not work: Counterexamples 
are (i) sa^ot and (ii) scx^ot. Another solution，more cumbersome, associates a Boolean 
variable with each edge of G] 

(c) Let s be any vertex; use (a)，plus (v n -i) for sl\ v ^ V \ s. 

(d) Clauses (so)A\J^~q \/ veV \ / weN ^ v) (vk\/ Wk+i) are satisfied only if we have vj > 
[d(s^ v) < j]; hence the additional clause (tk) cannot also be satisfied when d(s^ t) < k. 
Conversely, if d(s ， t) > k we can set Vj [d(s,v) < j]. 

⑷ (5) A (VvGvVwGNiv)^ V W )) A ©• 

(f) Letting 5 be any vertex, use (s) A (\/ v£V \/ w£N(v) (v W w)) A (\/ v£V \ s v). 

[Similar constructions work with digraphs and strong connectivity- Parts (d)-(f) 
of this exercise were suggested by Marijn Heule. Notice that parts (a) and (c)-(f) 
construct renamed Horn clauses, which work very efficiently (see exercise 444 )」 

391. (a) Let d — 1 = (qi-i - - . ^ 0)2 - To ensure that (xi-i .. • 3 : 0)2 < d we need the 
clauses (xiW \J{xj \ j > i, qj = 1}) whenever qi = 0. The same holds for y. 

To enforce x ♦ introduce the clause (a/—i V … V ao) in auxiliary variables 
a/-i •••(!()，together with (dj V Xj W yj) A {aj V Xj V yj) for 0 < j < / (see ( 172 )). 

(b) Now x ^ y is enforced via clauses of length 21^ which state that we don’t have 
x = y = k for 0 < k < d. For example, the appropriate clause when l = 3 and k = 5 is 
(x 2 V y 2 W xi W yi W x 0 W y 0 ). 

(c) Use the clauses of (b) for 0 < A: < 2d — 2^ plus clauses of length 21 — 2 
for d < k < 2 l stating that we don’t have (x/_i ... ^1)2 = {yi 一 1 - - - yi )2 = k. (The 
encodings in (b) and (c) are identical when d = 2 l .) 

[See A. Van Gelder, Discrete Applied Mathematics 156 (2008), 230-243.] 

392. (a) [Puzzle (ii) was introduced by Sam Loyd in the Boston Herald^ 13 November 
1904; page 27 of his Cyclopedia (1914) states that he’d created a puzzle like (i) at age 9! 
Puzzle {iv) is by H. E, Dudeney ， Strand 42 (1911) ， 108, slightly modified- Puzzle (in) 
is from the Grabarchuks^ Big, Big，Big Book of Brainteasers (2011) ， #196; puzzle (v) 
was designed by Serhiy A. Grabarchuk in 2015.] 
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(b) [Puzzle (vi) is an instance of the odd-even transposition sort，exercise 5.3.4- 
37 - Eight order-reversing connections would be impossible with only eight columns, 
instead of the nine in (vii ) ， because the permutation has too many inversions.] 
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(c) Let dj = ^2 J i=1 (\Ti\ — l) and d = dt. We introduce variables Vi for 1 < i < 
and the following clauses for 1 < j < t and dj-i < i < dj ： V Vi) for 1 < % < dj-i] 
the clauses of answer 390(b) on variables where s is the (i — dj-i)th element of Tj 
and t is the last element. These clauses ensure that the sets Vj = {v \ Vd j _ 1 +iy * - - V Vd 5 } 
are disjoint，and that Vj contains a connected component Sj D Tj. 

We also assert (vi) for 1 < i < whenever Tj is a singleton set {v}. 

[For the more general “Steiner tree packing” problem，see ML Grotschel, A. Mar¬ 
tin, and R. Weismantel, Math. Programming 78 (1997), 265—281.] 

393. A construction somewhat like that of answer 392(c) can be used with 
five different 8 x 8 graphs, one for the moves of each white-black pair Sj. 

But we need to keep track of the edges used, not vertices, in order to 
prohibit edges that cross each other* Additional clauses will rule that out. 

394. Call these clauses langford’" （ n). [Steven Prestwich described a similar method 
in Trends in Constraint Programming (Wiley ， 2007) ， 269-274.] Typical results are: 


variables clauses Algorithm D Algorithm L Algorithm C 


langford 1 

"⑼ 

206 

1157 

131 M^u 

18Mfi 

22Mfi 

(UNSAT) 

langford nf (13) 

403 

2935 

1425 G" 

44 Gfi 

483 G" 

(UNSAT) 

langford" 1 

(16) 

584 

4859 

713 Kp 

42 Mfi 

343 Kfi 

(SAT) 

langford’" 

(64) 7352 

120035 

(huge) 

(big) 

71 

(SAT) 


395. The color of each vertex v gets binary axiom clauses (v^ +1 W) for 1 < j < d—1, as 
in ( 164 ) • And for each edge u —— v in the graph, we want d clauses (W 1 V W 护 一 1 W) 
for I < j < omitting u° and v° when j = 1 , u d and v d when j = d. 

[The surprising usefulness of order encoding in graph coloring was first noticed by 
N. Tamura, A. Taga, S. Kitagawa, and M. Banbara in Constraints 14 (2009), 254-272.] 

396. First we have ( 元 J+1 Vx J ) and (£J +1 Vx J ) for 1 < j < d. Then we have “channeling” 
clauses to ensure that j < x < j + 1 4=^ jir < m + 1 for 0 < j < d: 

V x J+1 V x J7r ) A (x J V x J+1 V x^ +1 ) A V x J7r+1 V x J ) A V x^ +l V 

(These clauses should be either shortened or omitted in boundary cases，because x° and 
x° are always true，while x d and x d are always false. We obtain 6d—8 clauses for each x.) 

With such clauses for every vertex of a graph, together with clauses based on 
adjacent vertices and cliques, we obtain encodings for n-coloring the nxn queen graph 
that involve 2(n 3 — n 2 ) variables and |n 4 + 4n 3 + 0(n 2 ) clauses, compared to n 3 — n 2 
variables and |n 4 — n 3 + 0(n 2 ) clauses with single cliques and ( 162 ) alone. Typical 
running times with Algorithm C and single cliques are 323 K//, 13.1 M//, 706 Gfi for 
n = 7, 8 , 9; with double clique-ing they become 252 K//, 1.97 39.8 G/i, respectively- 

The double clique hints turn out to be mysteriously ineffective when it is the stan¬ 
dard organ-pipe permutation (07r ， l7r， ••” (d—l) 7 r) = (0, 2,4, • •. ， 5, 3, 1) instead of its 
inverse. Random choices of 7 r when n = 8 yielded significant improvement almost half 
the time, in the author’s experiments; but they had negligible effect in 1/3 of the cases. 

Notice that the example 7r for d = A yields x 1 = xo^ x 3 = X 3 ^ x 1 = x‘ 2 , 沄 3 = 
Hence the direct encoding is essentially present as part of this redundant representation, 
and the hints ( 公 3 V5 3 ) A (u 1 Vv 1 ) V (u 3 W v 3 ) A (u 1 Wv 1 ) for 2-cliques {u^v} are equivalent 
to ( 16 ). But the hints (u 2 V t ; 2 V w 2 ) A (u 2 V v 2 V w 2 ) A (u 2 \/ v 2 \/ w 2 ) A (u 2 V v 2 V w 2 ) 
that apply when {u^ w} is a triangle give additional logical power. 
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397. There are (p — 2)d binary clauses ( 匁广 1 V y] ) for 1 < i < p — 1, together with the 

(2p — 2)d clauses (xj V x{ +l V y]) A V x^l V y]) for 1 < i all for 0 < j < d. 

The hint clauses (a^- 1 V ... V x p p Z\) A (xq~ p+1 V ... V x^Zi +1 ) are also valid. 

(This setup corresponds to putting p pigeons into d holes, so we can usually 
assume that p < d. If p < 4 it is better to use (f 2 )d clauses as in exercise 395 - Notice 
that we obtain an interesting representation of permutations when p = d. In that case 
y is the inverse permutation; hence (2d — 2)p additional clauses corresponding to yj = i 
Xi = j are also valid, as well as two hint clauses for y.) 

A related idea, but with direct encoding of the x’s, was presented by L Gent 
and P- Nightingale in Proceedings of the International Workshop on Modelling and 

Reformulating Constraint Satisfaction Problems 3 (2004) ， 95-110. 

♦ 

398. We could construct (3p — 4)d binary clauses that involve as in exercise 397. 
But it’s better just to have (3p — 6)d clauses for the at - most-one constraints ^o/c + 
• • • + ^(p—i)k S 0 ^ k 〈 d. 

399. (a) d 2 —t preclusion clauses (binary); or 2d support clauses (total length 2{d+t)). 

(b) If unit propagation derives Vj from {vn V 巧)， we knew Ui; hence (17) gives 
for all i’ # i，and Vj follows from the support clause that contains it. 

(c) If unit propagation derives Vj from its support clause, we knew U{ for all i ^ j; 
hence (15) gives Uj, and Vj follows from (16) - Or if unit propagation derives Ui from that 
support clause, we knew Vj and for all x ^ {hj}] hence Uj from (16), Ui from (15). 

(d) A trivial example has no legal pairs; then unit propagation never gets started 
from binary preclusions, but the (unit) support clauses deduce all- A more realistic 
example has d = 3 and all pairs legal except (1 ， 1) and (1,2) ， say; then we have 
(15) 八 (17) A (uiW vi) A (ui V V2) A (v 3 ) l/i ui but (15) A (17) A {u\ V V3) A (^ 3 ) l~i u\. 

[Preclusion was introduced by S- W. Golomb and L. D. Baumert ，JACM 12 
(1965), 521-523. The support encoding was introduced by L R Gent, European Conf. 
on Artificial Intelligence 15 (2002) ， 121—125，based on work of S. Kasif ，Artificial 
Intelligence 45 (1990), 275-286.] 

400. This problem has n variables qi, • • • ， q n with n values each; thus there are n 2 

Boolean values, with qij = [qi =j] = [there’s a queen in row i and column j]. The 
constraint between qi and qj is that qi ^ {qj，qj + i — — i + j}] so it turns out that 

there are n at-least-one clauses, plus (n 3 —n 2 )/2 at-most-one clauses, plus either n 3 —n 2 
support clauses or n 3 — n 2 + Q) preclusion clauses. In this problem each support clause 
has at least n — 2 literals, so the support encoding is much larger. 

Since the problem is easily satisfiable, it makes sense to try Walks AT. When 
n = 20, Algorithm W typically finds a solution from the preclusion clauses after making 
fewer than 500 flips; its running time is about 500 K#, including about 200 Kfi just to 
read the input. With the support clauses, however, it needs about 10 times as many 
flips and consumes about 20 times as many mems，before succeeding. 

Algorithm L is significantly worse: It consumes 50 Mfi with preclusion clauses, 
11 Gfi with support clauses. Algorithm C is the winner, with about 400 K/i (preclusion) 
versus 600 Kfi (support)- 

Of course n = 20 is pretty tame; let’s consider n = 100 queens, when there are 
10,000 variables and more than a million clauses- Algorithm L is out of the picture; 
in the author’s experiments, it showed no indication of being even close to a solution 
after 20 T"! But Algorithm W solves that problem in 50 M#，via preclusion, after 
making only about 5000 flips* Algorithm C wins again, polishing it off in 29 Mfi. With 
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the support clauses, nearly 100 million literals need to be input, and Algorithm W is 
hopelessly inefficient; but Algorithm C is able to finish after about 200 M/x. 

The preclusion clauses actually allow us to omit the at - most-one clauses in this 
problem, because two queens in the same row will be ruled out anyway- This trick 
improves the run time when n = 100 to 35 M/x for Algorithm W- 

We can also append support clauses for the columns as well as the rows. This idea 
roughly halves the search space, but it gives no improvement because twice as many 
clauses must be handled. Bottom line: Support clauses don’t support n queens well- 
(However, if we seek all solutions to the n queens problem instead of stopping 
with the first one，using a straightforward extension of Algorithm D (see exercise 122), 
the support clauses proved to be definitely better in the author’s experiments.) 

401. (a) (b) In general w = \_{x + a)/b\ 4=^ = x b ^~ a . 


402. (a) A}=i 2J V x 2j ); (b) 八 以 21 (x 2j ^ 2 V x 2 ^ 1 ); omit x° and x d . 


403. (a) f\^~l(x J \/y J \/z j ); (b) W z j ) A(y J W z j )); (c) A ^ ( ( xJ V ^) A (y j V ^)); 

⑷ /\^11 (x j V y j V z j ). 

404. (a) 八 ，二 Vf +1-a V" J + a ). (As usual，omit literals with superscripts < 0 
or > d. If a > 1 this encoding is unsymmetrical, with one clause for each value of x.) 

(b) 八 ，二 V P V y j+a ) A (p V x^ +a V y j )); p is the auxiliary variable. 


405. (a) If a < 0 we can replace ax by (—a)x and c by c + a — ad, where x is given 
by ( 165 ) - A similar reduction applies if 6 < 0. Cases with a ， 6 , or c = 0 are trivial- 

(b) We have 13x + 8 y < 63 not 13x + 8 ^ > 64 not (Po or • • • or 
Pd-i) not Po and •… and not Pd 一 1, where Pj = c x > j and y >「(64 — 13j)/8~\\ 
This approach yields Aj=o(^ ^ 2 / 8 —「( 64 — 13 J)/ 8 1), which simplifies to (x 1 W y 1 ) A (x 2 W 

y s ) A (x s V y 4 ) A (x A V y 6 ) A (x°). (Notice that we could have defined Pj = c y > j 
and x >「(64 — 8j)/13]^ instead，thereby obtaining the less efficient encoding (x b ) A 
(y 7 V x 5 ) A (y 6 V x 4 ) A (y 5 V x 4 ) A (y 4 V x s ) A (y 3 V x 2 ) A (y 2 V x 2 ) A (y 1 V x 1 ); ifs better 
to discriminate on the variable with the larger coefficient-) 

(c) Similarly, 13x-\-Sy < 90 gives (x 5 V y 7 ) A (x 4 V y Q ) A (x 3 V y 4 ) A (x 2 V y 2 ) A (x 1 ). 
(The (x ， y) pairs legal for both (b) and (c) are (1,1), (2, 3) ， (3,4) ， (4,6)\) 

⑷八 ;= 二 : 「「 ㈡ ⑵ / (c+ 1 - 幻 )/61 )， when a > 6 > 0 and c > 0 . 

406 • ⑷ ( 〜「 ^^/(“”( ，十 1 )" 1 〜.)). 

(b) (A}S J+1 (^ V ^Ua-D/O-DJ+i)) A ( A r^ri (x L(-D/0-i)J+i v ^)) A 

(x l ) A (y 1 、, where l = [(a — l)/(d — 1)J + 1. [Both formulas belong to 2 SAT.] 

407. (a) We always have [x/2\ -f \x/2] = [x/2\ -f- [y/^\ < < [x/2\ -f- [y/^\ + 1, 

and \x/2] + \y/2] —1 < < \x/2] + \y/^]- (Similar reasoning proves the correctness 

of Batcher’s odd-even merge network; see Eq. 5.3.4—( 3 ).) 

(b) Axiom clauses like ( 164 ) needn’t be introduced for u and or even for 2：; so 
they aren’t counted here, although they could be added if desired. Let ad = d 2 — 1 be 
the number of clauses in the original method; then the new method has fewer clauses 
when arw] + a^/ 2 j+i + 3(d — 2) < a 山 namely when d > 7. (The new method 
for d = 7 involves 45 clauses, not 48; but it introduces 10 new auxiliary variables.) 
Asymptotically, we can handle d = 2 ^ + 1 with 3t2 f + 0(2^) = 3d\gd 0(d) clauses 
and d\gd-\- O(d) auxiliary variables. 

(c) x-\-y > z 4=^ {d — l — x) + {d — l — y) < (2d — 2 — z); so we can use the same 
method, but complemented (namely with x J x d ~ J \ ^ y d ~\ »-)* z 2d ~ l ~ j ). 
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[See N* Tamura, A. Taga ， S. Kitagawa, and M. Banbara, Constraints 14 (2009), 
254-272; R. Asm, R. Nieuwenhuis, A. Oliveras, and E. Rodriguez-Carbonell, Con¬ 
straints 16 (2011), 195-221.] 

408. (a) No; makespan 11 is best，achievable as follows (or via left-right reflection): 


Ml 


M2 


M3 




J1 


J3 



J3 


J2 



J1 


J2 


J3 


J1 



Ml 


M2 


M3 


J3 


J1 







J2 


J3 

J1 


J1 


J3 


J2 



(b) If j is the last job processed by machine that machine must finish at time 
< ELi ^ + Wkj — Wij, because j uses some other machine whenever i is idle. 
[See DT B. Shmoys7C. Stein, and J. Wein, SICOMP 23 (1994), 631.] 

(c) Clearly 0 < Sij < t — Wij. And if ij ^ i’ j’ but i = i’ ot j = j 、 we must have 
either Sij + Wij < Si ， j，or Si ， j ， + y < Sij whenever WijW^jf ^ 0. 

(d) When Wij >0， introduce Boolean variables s^j ior 1 < h < t — Wij, with 
the axiom clauses (s^ 1 V sfj) for 1 < A: < t — Wij. Then include the following clauses 
for all relevant j, i’，and j f as in (c): For 0<A:<t + l — Wij — y , assert 

(piji'j' V Sij V sl^ ij ) if ij < if or (pi'j'ij V V if ij > i j', omitting 卷 in 

the first of these ternary clauses and omitting U?，J， in the last- 

J 

[This method, introduced by N. Tamura, A. Taga, S. Kitagawa, and M. Banbara 
in Constraints 14 (2009) ， 254—272，was able to solve several open shop scheduling 
problems in 2008 that had resisted attacks by all other approaches.] 

Since the left-right reflection of any valid schedule is also valid, we can also save 
a factor of two by arbitrarily choosing one of the p variables and asserting (Piji ， j，）. 

(e) Any schedule for W and T yields a schedule for and \T/k^ if we 

examine time slots 0 ， A :， 2k, • • • • [With this observation we can narrow down the 
search for an optimum makespan by first working with simpler problems; the number 
of variables and clauses for 众 」 and T/k is about 1/k times the number for W 
and T, and the running time also tends to obey this ratio- For example, the author 
solved a nontrivial 8x8 problem by first working with |_^78」and getting the respective 
results (U ， S ， U) for t = (128, 130, 129)，where means “unsatisfiable” and means 
“satisfiable” ； running times were about (75, 10, 1250) megamems- Then with [ 只 "4」 it 
was (S ， U ， U) with t = (262, 260, 261) and runtimes (425, 275, 325); with [^/2J it was 
(U, S,U) with t = (526, 528,527) and runtimes (975, 200,900). Finally with the full W 
it was (U, S ， S) with t = (1058,1060,1059) and runtimes (2050, 775, 300), establishing 
1059 as the optimum makespan while doing most of the work on small subproblems.] 

Notes: Further savings are possible by noting that any clauses learned while 
proving that t is satisfiable are valid also when t is decreased. Difficult random problems 
can be generated by using the following method suggested by C. Gueret and C. Prins 
in Annals of Operations Research 92 (1999), 165-183: Start with work times Wij that 
are as near equal as possible, having constant row and column sums s. Then choose 
random rows i ♦ i and random columns j ^ /， and transfer S units of weight by 
setting Wij Wij — J ， w^j w^j +J ，Wiy +J, Wi，y — J, where S > Wij 

and S > this operation clearly preserves the row and column sums. Choose S at 

random between p - min{wij and min{w ； ij ， 卿 / }, where p is a parameter. The 

final weights are obtained after making r such transfers. Gueret and Prins suggested 

o 

choosing r = n , and p = -95 for n > 6; but other choices give useful benchmarks too. 
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409. (a) If 5 C {l ， ... ， r}，let S5 = a r 'Ve can assume that job n runs on 

machines 1 ， 2, 3 in that order. So the minimum makespan is 2w2 n + where x is the 
smallest Ss that is >「(ai + • • • + a r )/2]. (The problem of finding such an S is well 
known to be NP-hard [R. ML Karp, Complexity of Computer Computations (New York: 
Plenum, 1972) ， 97-100]; hence the open shop scheduling problem is NP-complete.) 

(b) Makespan W 2 n + u> 4 n is achievable if and only if H5 = (ai + • • • + a r )/2 for 
some Otherwise we can achieve makespan W 2 n + W4 n + 1 by running jobs 1， • • •, n 
in order on machine 1 and letting s 3 ( n — 丄 ） = 0, sau = W 2 n] also S 2 j = W 2 n + w ； 4n, if 
machine 1 is running job j at time W 2 n. The other jobs are easily scheduled. 

(c) L3n/2 」一 2 time slots are clearly necessary and sufficient. (If all row and 
column sums of W are equal to 5, can the minimum makespan be > |s?) 

(d) The “tight” makespan s is always achievable: By renumbering the jobs we 
can assume that aj < bj for 1 < j < aj > bj for k < j < bi = max{6i, ••” bk }， 
a n = max{a；c+i， • • •, a n }- Then if b n > ai，machine 1 can run jobs (1， • • • ， rz) in order 
while machine 2 runs (n ， 1, …， n — 1); otherwise (2, …， n, 1) and (1， … n) suffice. 

If ai + • • • + a n — 61 + ••• + 6 n ，we can increase a n or b n to make them equal. 
Then we can add a “dummy” job with a n +i = 6 n +i = maxjai + 61 ， … ， a n + 6 n } 二 s, 
and obtain an optimum schedule in 0{n) steps as explained above. 

Results (a) ，（ b) ， (d) are due to T. Gonzalez and S- Sahni，who introduced and 
named the open shop scheduling problem in JACM 23 (1976) ， 665-679. Part (c) is a 
subsequent observation and open problem due to Gonzalez (unpublished). 

410. Using half adders and full adders as we did in (23) allows us to introduce interme¬ 

diate variables Wj such that (x2XiXo)2 + (x2^i^o00)2 + (x2^i^o000)2 + (沒 2 沒 i 沒 oOOO)2 < 
(W7W6 … 詢 )2, and then to require (咖）八㈤ 6). In slow motion, we successively compute 
(co 2:0)2 > Xo + Xi, (Cl 2 ： 1)2 > Xo + Xi + 豆 0, (C‘2》2)2 > Cq+ (C 3 2 ： 3)2 > Xi + + 豆 1 ， 

(042:4)2 > Cl +C2 (052:5)2 > X 2 + V 2 +C 3 , (c 6 2： 6 )2 > C4 + (072:7)2 > c 5 +c 6 ； then 

W7W6 • • • = C7 Z7Z 6 Z 4 Z 2 Zoxixo- In slower motion，each step [c{Zi)2 > u + v expands 

to Zi 2 乜㊉ V， Ci > u Av; each step [c{Zi)2 > t + u + v expands to si 2 t ㊉ u, pi > t Au^ 
Zi 2 v ㊉ s， ％ > v A Ci > pi \/ qi. And at the clause level ，t > u A v 4=^ (t\/ u\/ v)] 
t >uW v (t\/ u) A(t\/ v); t > u®v 4=^ (t\/ u\/ v) A(t\/ u\/ v). [Only about half of 
(24) is needed when inequalities replace equalities. Exercise 42 offers improvements.] 

We end up with 44 binary and ternary clauses; 10 of them can be omitted，because 
么 0, 之 2, 之 4, 之 6, and Z 7 are pure literals，and the clause for 07 can be omitted if we simply 
require C5 = C6 = 0. But the order encoding of exercise 405 is clearly much better. The 
log encoding becomes attractive only with larger integers, as in the following exercise- 
[See J- P- Warners, Information Processing Letters 68 (1998), 63-69-] 

411. Use m + n new variables to represent an auxiliary number w = (w m -\- n ... wi) 2 - 
Form clauses as in exercise 41 for the product xy = w] but retain only about half of 
the clauses, as in answer 410. The resulting 9mn — 5m — lOn clauses are satisfiable 
if w = xy] and we have w > xy whenever they are satisfiable. Now add 3m + 3n — 2 
further clauses as in (169) to ensure that z > w. The case z < xy is similar• 

412. Mixed-radix representations are also of interest in this connection. See, for 
example, N. Een and N. Sorensson, J. Satisfiability^ Bool. Modeling and Comp. 2 

(2006), 1-26; T. Tanjo, N. Tamura, and M. Banbara, LNCS 7317 (2012), 456-462. 

413• Eliminating first a n _i, then a n -‘2, etc” yields 2 n — 1 clauses. (The analogous result 
for xi .. .x n < yi • • - y n is 2 n + 2 n-1 + !• A preprocessor will probably eliminate a n _i.) 
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414. Construct clauses for 1 < A: < n that represent c a/c_i implies Xk < yk + a。' 


^k -1 v V (x J k V y 3 k )^j A (h Va k V \/ (x J k V ?/i +1 )) , omitting x° k and y d k \ 


also omit do. For the relation xi ... x n < yi … y n we can omit the d clauses that 
contain the (pure) literal a n . But for xi ... x n < yi … y n , 观 want a n = 0; so we omit 

籲 • 

a n and the d — 1 clauses (a n _i M x 3 n W y J n ). 

415. There’s only one, namely Aai,...,a n Vcril/i V. • • Vcr n a: n Va n ?/ n ). Proof: 
Some clause must contain only positive literals，because /(0, ••• ，（ ) ） =0. This clause 
must be (xiW yiW • • • V V ?/ n ); otherwise it would be false in cases where / is true- 
A similar argument shows that every clause (crixi V diyi V … V cr n x n V cr n y n ) must be 
present. And no clause for f can contain both Xj and or both Xj and y[ 

416. The other clauses are 八 : 丄 ((Ui V ViV do) A (ui M ViM ao)) and (ao V ai V • • • V a n ). 
[See A. Biere and R. Brummayer, Proceedings^ International Conference on Formal 
Methods in Computer Aided Design 8 (IEEE, 2008)，4 pages [FMCAD08].] 

417. The four clauses (5 V t V ix) A (5 V t V v) A (5 V f V ?x) A (5 V t V v) ensure that 5 is 
true if and only if t? u: v is true. But we need only the first two of these, as in ( 174 ), 
when translating a branching program, because the other two are blocked in the initial 
step. Removing them makes the other two blocked on the second step, etc. 

418. A suitable branching program for h n when n = 3， beginning at in, is In = 

(1?21:22), I 2 i = (2? 31: 32), I 22 = (2? 32: 33), I 31 = (3? 0:42), I 32 = (3?42:43), h 3 = 
(3?43:1), Ia 2 = (1?0:1), /43 = (2?0:1). It leads via ( 174 ) to the following clauses for 
row i,l <i <m: x ik \/ r^/c+ij) A x ik \/ ri^ k+1J+1 ), for 1 < j < 

k <n; (fi, n+ i,i) A (ri ， n +i ， n +i) and V Xij) for 1 < j < n. Also the following 

clauses for column j, 1 < j < n: (ci,i,i); (cj^^V XkjV Cj^k+i,i) A (cj^^V Xkj\/ ), 

for 1 < i < A: < m; (cj, m+ i,i) A (Cj ， m +i ， m+ i) and V Xij) for 1 < i < m. 

419. (a) There are exactly n —2 solutions: Xij = [j = l][i^：m — l] + [j = 2][i = m — l] + 
[j = k][i = m—l]^ for 2 < k < n. 

(b) There are exactly m—2 solutions: Xij = [j > l][i = m —l]+[j = l][i = m —2] + 
[j = l][i = k]^ for 1 < k < m —2 or k = m. 

420. Start via ( 24 ) with (^1 V X 2 V s) A (xi V X 2 V s) A (xi V X 2 V s) A (xi V X 2 V 5 ); 
(xi V c) A (x 2 Vc) A (xi W x 2 Vc); (5 V x 3 V t) A (5 V V t) A (5 V x 3 V f) A (5 V x 3 V 

(s V c) A (X 3 V c) A (s V X 3 V c)\ (c) A {c). Propagate (c) and (5’)，obtaining (x\ V X 2 ) A 
(s V X 3 )； remove subsumed clauses (xi V X 2 V s)^ (5 V ^3 V f); remove blocked clause 
(5 V X 3 V t); remove clauses containing the pure literal t] rename 5 to ai. 

421. Start via ( 173 ) with (^ 5 V xi\/ a^) A (^ 5 V xi\/ as) A (« 4 V 无 ‘ 2 V a‘ 2 ) A ( 63 V X 2 V a‘ 2 ) 八 
(asW X 2 ) A («2 V xs) A (as). Propagate (as). 

422. (a) xi implies x‘ 2 , then ai, then ^ 3 ； X 2 implies xi, then ai, then ^ 3 - 

(b) xi implies a 3 , then x‘ 2 ，then a‘ 2 , then X 3 ； X 2 implies « 3 , then xi, a 4 , a‘ 2 , x^>. 

423. No; consider x\l (x 2 ^ xs ： X 4 ) - (^ 2 ? x\\ X 3 ) with L = (xs) A (^ 4 ). [But a forcing 
encoding can always be constructed, via the extra clauses defined in exercise 436. 
Notice that，in the presence of failed literal tests，weak forcing corresponds to forcing.] 

424. The clause 134 is redundant (in the presence of 123 and 234); it cannot be omitted ， 
because {23, 23, 12} l/i 3. The clause 234 is also redundant (in the presence of 134 and 
12); it can be omitted, because {13, 34, 1} l~i 4, {13, 34, 1} l~i 3, and {12,1,12} hi 2. 
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425• If x is in the core, F hi because Algorithm 7-1-1C does unit propagation. Oth¬ 
erwise F is satisfied when all core variables are true and all noncore variables are false. 


426. (a) True. Suppose the clauses involving a m are (a m V ai) for 1 < i < p and 
(a m V /3j) for I < j < q] then G contains the pq clauses (ai V /3j) instead. If F | L hi l 
we want to prove that G \ L hi 1. This is clear if unit propagation from F \ L doesn’t 
involve a m . Otherwise, if F \ L hi a m , unit propagation has falsified some ol{\ every 
subsequent propagation step from F \ L that uses (a m V /3j) can use (ai V /3j) in a 
propagation step from G\L. A similar argument applies when F\L \~i d m . 

(Incidentally, variable elimination also preserves “honesty .”） 

(b) False. Let F = (xiW V ai) A (xi V ^ 2 V ai), L = xi or ^ 2 - 

427. Suppose n = 3m，and let / be the symmetric function [vx<m or vx>2m}. The 
prime clauses of / are the iV = ( m : m ) 〜 3 n+3//2 /(27rn) ORs of m positive literals 

and m negative literals- There are N f = ( m _ 1 ^ m+1 ) = ways to specify that 

Xi 1 = • • • = Xi m =1 and Xi m+1 = • • • = Xi 2m _ 1 = 0 ; and this partial assignment 
implies that Xj = 1 for j 多 {ii ，•…， Therefore at least one of the m + 1 clauses 
(xi-^ V • • • V Xi m V Xi m+1 V • • • V Xi 2m _ 1 V Xj) must be present in any set of prime clauses 
that forces f. By symmetry, any such set must include at least N f /m prime clauses. 
On the other hand，/ is characterized by 0(n 2 ) forcing clauses (see answer 436). 


428. (a) {y\J ZjiW • • • V Zjd) for 1 < j < n; (xij\/ 為 fcV Zjk) for l<i<j<n^l<k<d. 

(b) Imagine a circuit with 2N(N + 1) gates gu, one for each literal l of G n d and 
for each 0 < t < meaning that literal l is known to be true after t rounds of unit 
propagation, if we start with given values of the Xij variables only. Thus we set gio <— 1 


if l = Xij and Xij is true, or if l 


Xij and Xij is false; otherwise gio <— 0. And 




unit propagation 
honesty 
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9i(t-\-i) 9it v \J {9ht 八…八沉以 I (/ V /i V • • • V //c) G G n d }， for 1 <t < N. 


Given values of the the literal y is implied if and only if the graph has no d - coloring; 
and at most N rounds make progress- Thus there’s a monotone chain for g y N = f n d- 
[This exercise was suggested by S. Buss and R. Williams in 2014， based on a 
similar construction by M. Gwynne and O. Kullmann.] 

429. Let S/c be the sum of the assigned x’s in leaves descended from node k. Unit 
propagation will force ^ 1 for 1 < j < Il/c, moving from leaves toward the root. 
Then it will force bj 4 - 0 for j = 1^ + 1， moving downwards from the root, because 
r = S ‘2 + S 3 and because ( 21 ) starts this process when A: = 2 or 3. 

430. Imagine boundary conditions as in answer 26, and assume that Xj 1 , •••, Xj r 
have been assigned 1, where ji < … < j r . Unit propagation forces s k j k+l __ k 1 for 
1 < A: < r; then it forces Sj k ^ k 0 for r > A: > 1 - So unassigned x’s are forced to zero. 

431. Equivalently + . + - Vyn < so we can use ( 18 )—( 19 ) or ( 20 )—( 21 ). 

432• The clauses of answer 404(b) can be shown to be forcing. But not those of 404(a) 
when a > 1 ; for example, if a = 2 and we assume x 2 , unit propagation doesn’t yield y 2 • 

433. Yes. Imagine, for example, the partial assignment x = 1***10**1, y = 10*00*1**- 
Then must be 1; otherwise we’d have 10010001 < x < y < 100001111. In this situ¬ 
ation unit propagation from the clauses that correspond to 1 < 〈 aiOl〉，ai < 〈 a‘ 2 无 2 O〉, 

a *2 < («3^3?/3), < (a 4 ^ 40 ), «4 < (asOO) forces ai = 1 ， a ‘2 = 1 ， a4 = 0, a3 = 0, 2/3 = 1. 

In general if a given partial assignment is consistent with x < we must have 
xl < 以个 ， where and 以 个 are obtained from x and y by changing all unassigned 
variables to 0 and 1 ， respectively- If that partial assignment forces some yj to a 


September 23, 2015 



7.2.2.2 


ANSWERS TO EXERCISES 


271 


n.f.: not falsified 
Bacchus 


particular value，the value must be 1; and we must in fact have x\. > y’ 个 ， where 
y f is like y but with yj = 0 instead of yj = *. If Xj ^ 1, unit propagation will force 
ai = ••• = aj 一 i = 1 ， a；c = ••• = aj = 0, %• = 1， for some k > j. 

Similar remarks apply when Xi is forced，because x < y y < x. 

4S4. (a) Clearly pk is equivalent to xi 八 • • • 八 x/c, g/c is equivalent to x/c A • • • A x ni and 
Tk implies that a run of exactly l Is begins at Xk. 

(b) When Z = 1， if x/c = 1 unit propagation will imply pj for j > k and qj for 
j < hence fj for j ^ k] then is forced, making Xj = 0 for all j ^ k. Conversely, 
Xj = 0 forces fj] if this holds for all j ^ then Tk is forced, making x/c = 1. 

But when l = 2 and n = 3, the clauses fail to force X 2 = 1 by unit propagation. 
They also fail to force xi = 0 when we have / = 2, n = 4, and X 3 = 1- 

435. The following construction with 0(nl) clauses is satisfactory when l is small: 
Begin with the clauses for pk and qk (but not rk) in exercise 434(a); include also 
(Xk ^ Pk-i) for / < A: < n, and (x k V q k +i) for 1 < k <n-l. Append (p k -i V q k ^i V x k ) 
for 1 < A: < n, omitting pj for j < 1 and omitting qj for j > n. Finally, append 

(xk V Xk-^i V Xk-^d) for 0 < A: < n and 1 < d < I ， (*) 

omitting Xj when j < 1 or j > n. 

To reduce to 0(n log Z) clauses, suppose 2 e+1 < l < 2 e + 2 ，where e > 0 - The 
clauses (*) can be replaced by (xk V y[ e ^ V z[ e ^) for 1 < A: < n, if Xk-d implies y[ e ^ 
for 1 < d < [l/ 2 \ and Xk-\-d implies for 1 < d < |"//2]. And to achieve the latter ， 
we introduce clauses V #+” ）， (y^t V (z^ V z[ t+1) ), (4+ 2 ^ v 4，+”）, 

(x k -i\/y ( k 0) ), v^ 0) ), v4 0) ), 0 ^ 一 2 石 + 1+「//21 v4° } ), for 1 < /c < n 

and 0 < t < e，always omitting Xj or yj or Zj when j < 1 or j > n. 

436• Let the variables g；, for 0 < A: < n and q ^ Q represent the sequence of states, and 

let tkaq represent a transition when 1 < A; < n and when T contains a triple of the form 

0/, a ,q). The clauses, are the following，for 1 < A: < n: (i) {tkaq^ x%) A (tkaq^ Qk)^ 
where x° k denotes Xk and x\ denotes Xk] (ii) (Qk-i^\/{tkaq , I (g ， a ， Y) G T}), for g G Q; 
(iii) (q k W \/{tkaq I (q\a,q) G T}); (iv) (x a k V \J{t k aq \ (g,a,g) G T}); (v) {t kaq > V 
\J{qk-i I G T}); together with (vi) (^ 0 ) for q e Q\I and (q n ) for q e Q\0. 

It is clear that if F hi no string xi ... x n G L can have Xk = a. Conversely, 
assume that F \/i x a k ^ and in particular that F l/i e. To prove the forcing property, we 
want to show that some string of L has Xk = a. It will be convenient to say that a 
literal l is ‘n.f/ (not falsified) if F i/i T; thus x% is assumed to be n.f. 

By (iv), there’s a (q\a^q) G T such that tkaq is n.f. Hence qk is n.f” by (i). If 
k = n we have g G O by (vi); otherwise some is n.f” by (ii), hence x b k+1 is n.f. 

Moreover, (v) tells us that there’s [q"6 T with q^i n.f. If k = 1 we have q G /; 
otherwise some t^-iy q n is n.f” by (iii), and x c k _ 1 is n.f. Continuing this line of reason¬ 
ing yields xi ... x n G L with Xk = a (and with Xk+i = b if k < n, Xk-i = c if k > 1). 

The same proof holds even if we add unit clauses to F that assign values to one 
or more of the x’s. Hence F is forcing. [See F. Bacchus, LNCS 4741 (2007), 133-147.] 
The language L 2 of exercise 434 yields 17n + 4 clauses: F = 八 1</c<n ((^/cooVx/c ) 八 
(^/cOoV 0/c) A (f/cii V Xk) A (t/cii V 1/c) A (t/d2 V Xk) A (f/ci2 V 2/,) A (^ 02 V XkJ A (^ 02 V 2/,) A 
(0/c-i V tkoo V tkii) A (l/c-i V tki 2 ) A (2/c_i V “ 02 ) 八 (a^/c V tkoo V “ 02 ) 八 (^/cV tkii V ^ 12 ) A 
(?A；oo V 0 a ； — 1 ) 八 (tkii V 0a ； _i) A (tki 2 ^ 1a ： -i) A (tk 02 V 2/j._i)) A (lo) A (2o) A (0 n ) A (l n ). 
(Unit propagation will immediately assign values to 10 of the 8n + 3 variables, thereby 
satisfying 22 of these clauses, when n > 3 - For example, fu‘ 2 , Lii ， On-i are forced.) 
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The clauses produced by this general-purpose construction can often be signifi¬ 
cantly simplified by preprocessing to eliminate auxiliary variables. (See exercise 426.) 

437. Each variable Xk now becomes a set of \A\ variables Xka for a G A, with clauses 
like ( 15 ) and ( 17 ) to ensure that exactly one value is assigned. The same construction is 
then valid，with the same proof，if we simply replace by c Xk a ’ throughout. (Notice 
that unit propagation will often derive partial information such as Xka^ meaning that 
Xk 7 ^ although the precise value of Xk may not be known.) 

438. Let l<j = h + ••• + lj. Exercise 436 does the job via the following automaton: 

Q — {0? I?---? l<t 1 — 1}, I = {0}, O = {/ <* + t — 1}; T = {(l<j + j, 0, l<j + j) I 
^ ^ j < t} U {(/ <j + j + A: ， 1 ， l<j + j + A: + 1) I 0 < j < ^ 0 < A: < U 

W<j + j — 1 ， 0 ， + j — [j = t]) \ 1 < j < t}. 

439. We obviously want the clauses (xj V %+i) for 1 < j < n; and we can use ， say ， 
( 18 ) and ( 19 ) with r = ^ to force Os whenever the number of Is reaches t. The difficult 
part is to force Is from partial patterns of Os; for example, if n = 9 and t = 4, we can 
conclude that ^4 = ^6 = 1 as soon as we know that X 3 = X 7 = 0 . 

An interesting modification of ( 18 ) and ( 19 ) turns out to work beautifully, namely 
with the clauses (tj V t, +1 ) for 1 < j < — 1 and l<h<n — together with 

(a> 2 j+/c-i V 發 j — for 1 < j < t and 0 < k < n — 2 尤 + 1 ， omitting t^j-i andt^l 2 f+ 2 . 

440. It’s convenient to introduce (HiV| variables Pik for all P G and for 1 < i < 
A: < n, as well as \N\ 2 variables QRijk for Q, R 6 N and for 1 < i < j < k < 
although almost all of them will be eliminated by unit propagation. The clauses are: 

⑴ {QRijk V Q^o-d) A (QR ljk V Rjk )； (ii) (Pkk^ \J{xt \ p^ae U}); (iii) (F ik V 
\J{QRijk \ i<j<k, P QRe W}), if i<k; (iv) (xt V \J{Pkk \ p ae [/}); 
(v) V \/{PR i(k+1)l \k<l<n,ReN}\/ \J{QR hik \l<h<i,Qe iV}), if i > 1 
or k <n; (vi) (QR ijk V \J{Pik \ P^QRe W}); (vii) (Pin) for PeN\S. 

The forcing property is proved by extending the argument in answer 436: Assume 
that is n.f.; then some Pkk with P ^ a is also n.f. Whenever is n.L with i > 1 

or A: < n，some PRi^iy or QRhik is n.f.; hence some “larger” or P f hk is also n.f. 

And if Pin is n.f., we have P ^ S. 

Furthermore we can go “downward ”： Whenever Pik is n.f. with i < k, there’s 

C? • 一 1 ) 

such that x a k is n.f. Our assumption that x a k is n.f. has therefore shown the existence 
oi xi ... x n ^ L with Xk = a. 

[See C.-G. Quimper and T. Walsh, LNCS 4741 (2007), 590-604]. 

441. See O. Bailleux, Y. Boufkhad, and O. Roussel, LNCS 5584 (2009), 181—194. 

442. (a) F I L~ = F\h \.. .\l q -i\I q contains e if and only if _F | Zi | ••• | l q -\ contains e 
or the unit clause (l q ). 

(b) If F l/i / and F | [ hi e, the failed literal elimination technique will reduce F 
to F| / and continue looking for further reductions. Thus we have F h 2 l if and only if 
unit propagation plus failed literal elimination will deduce either e or l. 

(c) Use induction on k] both statements are obvious when k = 0. Suppose we 
have F h/c+i l via h, … ， l p = T, with F \ L~ h/c e for 1 < g < p. If p > 1 we have 
F\l\L~ h/c e for 1 < ^ < p; it follows that F \ l h/c+i l p -i and F \ l h/c+i l p -i. If p = 1 
we have F\l\~k e. Hence F\ l hk+i e in both cases. 

Now we want to prove that F | l hk+i e and F hfc +2 e, given F hk+i I’ and 
F h/e+i T. If F I L~ h/, e for 1 < ^ < p, with l p = I ’ ， 观 know that F \ L~ \~k+i e- 
Furthermore we can assume that F l//c+i l; hence I ^ l q ^or 1 < q < and l ^ l p . If 


and Rjk are n.f.; on the other hand if Pkk is n.f., there’s a G A 


QRijk such that Qi 
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l = l q for some q < p, then F\l\L~ 卜 ^ e for 1 < r < g and F \ L~ h/c e for ^ < r < p; 
otherwise F | / | L~ h/c e for 1 < ^ < p. In both cases F \ l hk-\-i l f and F \~k -\-2 
Essentially the same proof shows that F\ l \~k+i ? and F h k+ 2 

(d) True, by the last relation in part (c)- 

(e) If all clauses of F have more than k literals, L/ C (F) is empty; hence Lo(R f )= 
Li(R f ) = L 2 {R r ) = 0. But Lk(R ’） = {i ， 2, 4} for k > 3; for example ， 丑 ’ h 3 i because 
R f 11 h 2 e, because R f 11 h 2 3 and R f 11 h 2 3- 

(f) Unit propagation can be done in O(N) steps if N is the total length of all 
clauses; this handles the case k = 1. 

For k > 2^ procedure Pk{F) calls Pk-i{F \ x\)^ Pk-i{F \ x\)^ Pk-i{F \ X 2 )^ etc” 
until either finding Pk-i(F \ l) = {e} or trying both literals for each variable of F. In 
the latter case, Pk returns F. In the former case, if Pk-i{F \ l) is also {e} 5 Pk returns 
{e}; otherwise it returns Pk(F \ l). The set contains all literals for which weVe 
reduced F to F\ unless Pk{F) = {e}. (In the latter case, every literal is in Lk.) 

To justify this procedure we must verify that the order of testing literals doesn’t 
matter. If F\l \~k e and F\T \~k e, we have F\l\T h k e and F\l r \1\-^ e by (c); hence 

Pk(F\l) = P k {F\l\l') = P k (F\r\l) = P k {F\l'). 

[See O. Kullmann ，Annals of Math, and Artificial Intell. 40 (2004) ， 303-352.] 
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443. (a) If F I L he then F \ L h l for all literals /; so if F G PC/c we have F \ L l 
and F \ L l and F\ L \- k proving that PC^ C UC/^ 

Suppose F G UC/c and F \ L h 1. Then F \ L\l h and we have F | L | [ h；, e. 
Consequently F\ L h/, 4.1 proving that UC/c ^ PC/c+i. 

The satisfiable clause sets 0 , {1}, {1,12}, {12,12}, {12,12,12,123}, {123,123, 
123,123}, {123,123,123,123,123,123,123,1234}, …， show that PCa, ^VC k ^ PC/,+ 1 . 

(b) F G PCo if and only if F = 0 or 6 G F. (This can be proved by induction on 
the number of variables in because e 丰 F implies that F has no unit clauses.) 

(c) If F has only one clause, it is in UCo. More interesting examples are {12,12}; 
{1234,1234}; {1234,1234,1234,1234}; {12,12,345,345}; etc. In general, F is in UC 0 
if and only if it contains all of its prime clauses. 

(d) True, by induction on n: If F| L hi then F\L\l he, and F\L\l has < n — 1 
variables; so F| L| [ G PC n -i C UC n —i. Hence we have F\L\l h n _i e and F\ L h n 1. 

(e) False，by the examples in (c)- 

(f) R r G UC 2 \ PC‘ 2 . For example, we have R r \ 1 (- 2 2 and R' \ 1 h 2 2. 

[See M. Gwynne and O. Kullmann ， arXiv: 1406.7398 [cs.CC] (2014)，67 pages-] 

444. (a) Complementing a variable doesn’t affect the algorithm’s behavior, so we can 
assume that F consists of unrenamed Horn clauses- Then all clauses of F will be Horn 
clauses of length > 2 whenever step E2 is reached. Such clauses are always satisfiable, 
by setting all remaining variables false; so step E3 cannot find both F hi l and F hi l. 

(b) For example, {12, ） 3,123,123}- 

(c) Every unsatisfiable F recognized by SLUR must be in UCi- Conversely，if 
F G UCi, we can prove that F is satisfiable and in UCi whenever step E2 is reached. 

[Essentially the same argument proves that a generalized algorithm, which uses 
\~k instead of hi in steps El and E3, always classifies F if and only if F G UC/c- See 
ML Gwynne and O. Kullmann, Journal of Automated Reasoning 52 (2014) ， 31—65 」 

(d) If step E3 interleaves unit propagation on F | / with unit propagation on 
F I l^ stopping when either branch is complete and e was not detected in the other ， 
the running time is proportional to the number of cells used to store using data 
structures like those of Algorithm L. (This is an unpublished idea of Klaus Truemper.) 
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[SLUR is due to J. S. Schlipf, F. S- Annexstein, J. V. Franco，and R. P- Swami- 
nathan ，Information Processing Letters 54 (1995), 133-137.] 

445. (a) Since the lexicographic constraints (i6g) are forcing, a succinct certificate is 

， X2m 5 • • • ，— l)m ， $ 2 (m—l) ， $3(m—l )， • • • ， ^(m— l)(m— 1 ) 5 $3(m—2) ， $4(m — 2)，• • • ， 

X( m _i)( m _ 2 ) 5 •… ， 元 (m_i) 2 ， 0) - The first m — 1 steps can be replaced by c xom^ 

(b) 1 ) 15 $(m - 2)2， ••” $l(m-1) ， 0). 

(C) (TOI ， $12， • • • ， —2)(m-l) ， 0). 

446. Z(m^ n) — 1, because a 4-cycle corresponds to a quad- 

447. For general m and n we can add the m-n-/3! constraints (xij V 

Xitj V Xi>j> V V V Xijff) to ( 184 )，for 1 < i < i r < i n < m 

and distinct {j ， / ， j"} G {1 ， •… 5 n}. The 19-edge graph illustrated here works when 
m = n = 8] and Algorithm C finds girth > 8 unsatisfiable with 20 edges, after only 
400 megamems of calculation (using lexicographic row / column symmetry). 

448. Each pair of points can occur together in at most one line. If the lines contain 

respectively h ， … ， points，we therefore have (^ 1 ) H - h (^) ^ (?) = 3n. A Steiner 

triple system achieves equality，with h = ••• = l n = 3. Since ~ x ) + ( l J 1 ) < (^) + (%) 
when / > + 2, we can’t have h + • • • + > 3n. Thus Z(m ， n) = 3n + 1. 

[If m is even and (?) = 3n，we can’t cover all the pairs with triples, because no 
point can be in more than (m — 2)/2 triples. Daniel Horsley has shown that Z{m^n )= 
3n + [1 — m/14」in such cases; see (a paper in preparation).] 
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449. It’s wise to try first for symmetric solutions with Xij = Xji, roughly halving the 
number of variables; then the matrices below are found quickly. Such solutions are 
impossible when n = 9 ， 12， 13 (and also when n = 15 and 16 if we insist on five 
Is in the top row). The case n = 13 corresponds to the projective plane of order 3; 


indeed, a projective plane of order q is equivalent to a maximum quad-free matrix with 



Q 


2 


+ g + 1 and Z(n^ n) 


=(g + l)n + 1 . 


11100000 

10011000 

10000110 

01010100 

01000011 

00110001 

0010101() 

00001101 


111100000 

100011100 

100000011 

010010010 

010001001 

001010001 

001000110 

000101010 

000100101 


111100000() 

1000110000 

1000001100 

1000000011 

0100101010 

0100000101 

0010100001 

0010010110 

0001100100 

0001011001 


11110000000 

10001100000 

10000011100 

10000000011 

01001010010 

01000001001 

00101001000 

00100110001 

00100000110 

00011000101 

00010101010 


111100000000 

100011100000 

100000011000 

100000000111 

010010010100 

010001001010 

010000100001 

001010001001 

001000110010 

000110000010 

000101010001 

000100101100 


1111000000000 

1000111000000 

1000000111000 

1000000000111 

0100100100100 

0100010010010 

0100001001001 

0010100010001 

0010010001100 

0010001100010 

0001100001010 

0001010100001 

0001001010100 


11110000000000 

10001110000000 

10000001110000 

10000000001110 

01000100001001 

01001001000100 

01000010100010 

00100101000010 

00100010011000 

00100000100101 

00011000101000 

00010100010100 

00010011000001 

00001000010011 


111100000000000 

100011100000000 

100000011100000 

100000000011110 

010010010010000 

010001001001000 

010000000100101 

001010001000011 

001001010000100 

001000100110000 

000110000101000 

000101000010001 

000100101000100 

000100010000010 

000000110001001 


1111000000000000 

1000111000000000 

1000000111100000 

1000000000011100 

0100100100010000 

0100010010001010 

0100000001000101 

0010100010000100 

0010010100000001 

0010001001001000 

0010000000110010 

0001100000101001 

0001010001010000 

0001001100000110 

0000010000100100 

0000001010010001 


450• To prove the hint，add the unary clause (^ 15 ) to the others; this problem is rapidly 
found to be unsatisfiable, hence no line has more than 4 points. On the other hand, a 
line with fewer than 3 points is impossible because Z(9,10) = 32. The same arguments 
show that every point belongs to either 3 or 4 lines. Thus exactly four lines contain 
four points, and exactly four points lie on such lines. 

If p G / and l is a 4 - point line, every other line containing p must contain 2 of 
the remaining 6 points. And the four 4 - point lines contain at least 4x4—($) =10 
points altogether. Hence, pigeonwise，we see that each of the four 4 - point lines contains 
exactly one of the four 4-line points. 

Now we may call the 4-line points {a ， 6 , c, d}，and the 4-point lines {A^ D}. 

The other points may be called {ab, ac, ad, bc’bd, cd\, with A = {a^ ab^ac^ad}^ B = 
{b ， ab,bc,bd}, C = {c ， ac ， 6 c ， cd}, D = {d,ad ， bd,cd}. The other lines can be called 
{AB^ AC^ AD^ BC, BD, CD}; and we have AB = {a, 6 , cd }，AC = {a, c, ad}^ etc- 
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451. One of the colors can be placed uniquely, by the previous exercise- So we’re left 
with the simple problem of two-coloring the remaining 66 squares and avoiding both 
0-quads and 1-quads. That problem is unsatisfiable with Xij odd. The author then 
constructed a 33 + 33 + 33 solution by hand，using the fact that each color class must 
be unable to use the deleted square. [See M. Beresin, E. Levine, and 
J. Winn ，The College Mathematics Journal 20 (1989) ， 106-114 and the 
cover; J. L. Lewis ， J. Recreational Math. 28 (1997), 266-273.] 


452• Any such solution must have exactly 81 cells of each color, because 
R. Nowakowski proved in 1978 that Z(18,18) = 82. The solution exhibited 
here was found by B. Steinbach and C- Posthoff [Multiple-Valued Logic and 
Soft Computing 21 (2013) ， 609-625]，exploiting 90° rotational symmetry* 

453. (a) If RC {1,..., m) and C C {1,..., n}, let V (i?, C) = {ui \ i G \ j EC}. If 

X is decomposable, there’s no path from a vertex in V {R, C) to a vertex not in V {R, (7); 
hence the graph isn’t connected. Conversely，if the graph isn’t connected, let V (R, C) be 
one of its connected components- Then 0 < |i?| + |(7| < m+n，and weVe decomposed X. 

(b) False in general, unless every row and column of X r contains a positive 
element. Otherwise, clearly true by the definition of lexicographic order. 

(c) True: A direct sum is certainly decomposable- Conversely, let X be decom¬ 
posable via R and C. We may assume that 1 G i? or 1 G (7; otherwise we could replace 
R by {1, ••” m) \ R and (7 by {1 ， •…， n} \ (7. Let i > 1 and j > 1 be minimal such 
that i 丰 R and j ^ C. Then x^j = 0 for 1 < i 7 < i and = 0 for 1 < j < j. 
The lexicographic constraints now force = 0 for 1 < i 7 < i, j > j; also for i > i, 
1 < J < j. Consequently = X’ ㊉ X", where X' is (i — 1) x (j — 1) and X" is 
(m + 1 — i) x (n + 1 — j). (Degenerate cases where i = 1 or j = 1 or i = m + 1 or 
j = n +1 need to be considered, but they work fine. This result allows us to “read off” 
the block decomposition of a lexicographically ordered matrix.) 

Reference: A. Mader and O- Mutzbauer, Ars Combinatoria 61 (2001), 81-95- 

454. We have f{x) < f{xr) < f(xrr) < ••- < f{xr k ) < • • •; eventually xr k = x. 

455. (a) Yes, because C only causes 1001 and 1101 to be nonsolutions, (b) No, because 
F might have been satisfied only by 0011. (c) Yes as in (a), although ( 187 ) might no 
longer be an endomorphism of FAC as it was in that case- (d) Yes; if 0110 is a solution, 
so are 0101 and 1010. [Of course this exercise is highly artificial: We’re unlikely to know 
that a weird mapping such as ( 187 ) is an endomorphism of F unless we know a lot 
more about the set of solutions.] 

456. Only (1 + 2.7)(1 + 2)(1 + 8 ) = 405, out of 65536 possibilities (about 0.06%). 

457. We have mm 0 <k<i6(k k ^6 16 ~ k ) = 6 6 16 10 « 51.3 x 10 16 . For general n, the mini¬ 
mum occurs when k = 2 n /e+ 0 (l); and it is 2 2 ^ ^ n ~ x ^> where x = l/(e In 2 )+ 0 ( 2 _n ) < 1 . 

458• The operation of assigning values to each variable of an autarky, so that all clauses 
containing those variables are satisfied, while leaving all other variables unchanged, is 
an endomorphism. (For example, consider the operation that makes a pure literal true.) 

459. sweep(X^j) = 0 when i = 0 or j = 0. And for 1 < i < m and 1 < j < n we have 
sweep (X^) = max (而 j + sweep(X (i _i) (j _i )) 5 sweep^^i^-), sweep(X i(j _i))). 

[Let the Is in the matrix be Xi 1 j 1 ， •… ， Xi r j r , with 1 < i\ < — < i r < m and with 
j q+ i < j q when = i q . Richard Stanley has observed (unpublished) that sweep(X) 
is the number of rows that occur when the Robinson-Schensted-Knuth algorithm is 
used to insert the sequence n — ji, •…, n — j r into an initially empty tableau.] 
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460. We introduce auxiliary variables s\j that will become true if sweep(Xij) > t. 
They are implicitly true when t < 0, false when t = k. The clauses are as follows, for 
1 < i < m, 1 < j < n, and 0 < t < min(i — 1， j — 1 , k): ( 略 - V s\j), if i > 1 and 

t < v 4j), if j > 1 and t < k; and (xij V 《二 1 收一” V s\j). Omit 5^^ and 

忑 ^l\)o an d and s^j from that last clause, if present. 

461. VSI 1 \! n jZl V qi-i^Vc^) A \J ?=1 VJ=i(c(i-i)jV% V x i(j+1) ), omitting c 0j . 
These clauses take care of n; interchange i ^ m n for T 2 - 

462. Let Xij denote the last m + 1 —J, rows and the last n + 1 — j columns of X; and 
let tij = sweep(X“ 一 i)(j 一 i))+sweep(X(i+i)(j+i)). For n we must prove l + h(j+i) < 众， 
given that 1 + tij < k. It’s true because sweep(X( i _ 1 ) J -) = sweep(X( i _ 1 )( J -_ 1 )) when 

column j begins with i — 1 zeros，and we have sweep(X( i+1 )(j +2 )) < sweep(JC(i +1 )(j +1 )). 

Let X’ — XT 3 have the associated sweep sums t\j. We must prove that t\j < k and 
l + ^ +1)(j+1) < k, ifl + tij < k, l + ^ ( j + i) < k, l + t {i+1)j < k, andt (i+ i )(i+ i) < k. The 
key point is that sweep(X^-) = max(sweepsweep- 1 ))), since x\j = 0 . 
Also sweep(X ( , . +1)(j+1) ) = 1 + sweep(X (i+2)(j+1) ). 

(Notice that t\ and T 2 might actually decrease the sweep, but T 3 preserves it.) 


463• If row i + 1 is entirely zero but row i isn’t , 乃 will apply. Therefore the all-zero 
rows occur at the top. And by n, the first nonzero row has all its Is at the right. 

Suppose rows 1 through i have n， …， n Is, all at the right, with n > 0. 
Then r 1 < ••- < by T 2 - If i < n we can increase i to i + 1， since we can’t have 
> X(i_|_i)(j + i) when j < n — r 《， by ti; and we can’t have it when j > n — ri, hy T3. 
Thus all the Is are clustered at the right and the bottom, like the diagram of a 
partition but rotated 180°; and the sweep is the size of its “Durfee square” (see Fig. 48 
in Section 7-2.1.4). Hence the maximum number of ls，given sweep is k(m + n — A:). 

464. By answer 462, n can be strengthened to 丁 ’ u which sets 卜 1 but leaves 

Xij = 1. Similarly , 乃 can be strengthened to These endomorphisms preserve the 
sweep but increase the weight, so they can’t apply to a matrix of maximum weight. [One 
can prove, in fact，that max-weight binary matrices of sweep k are precisely equivalent 
to k disjoint shortest paths from the leftmost cells in row m to the rightmost cells in 
row 1. Hence every integer matrix of sweep k is the sum of k matrices of sweep 1-] 

465. If not ， there’s a cycle xo —>• xi • • • —> = xo of length p > 1, where XiT UVi 4 
Xi+i. Let uv be the largest of {uv\^ • • •, uv p -i}. Then none of the other r^s in the cycle 
can change the status of edge uv. But that edge must change status at least twice. 


466. Notice first that vn must be true, if m > 2. Otherwise /in, 仍 1 ， /i‘ 22 , 仍 2 , ... 
would successively be forced by unit propagation, until reaching a contradiction at the 
edge of the board. And V31 must also be true, if m > 4, by a similar argument- Thus the 
entire first column must be filled with verticals, except the bottom row when m is odd. 

Then we can show that the remainder of row 1 is filled with horizontals, except 
for the rightmost column when n is even. And so on. 

The unique solution when m and n are both even uses Vij if and only if i + J is 
even and max(i, m — i) < j < n/2，or i + j is odd and i^( n + 1 _j) is used. When m is 
odd, add a row of horizontals below the (m — 1) x n solution• When n is odd，remove 
the rightmost column of verticals in the m x (n + 1) solution. 

467. The 8x7 covering is obtained by reflection of the 7x8 covering 
(shown here) about its southwest-to-northeast diagonal- Both solutions 
are unique. 
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468. (a) Typical running times with Algorithm C for sizes 6 x 6 , 8 x 8 , .. • , 16 x 16 
are somewhat improved: 39 K#，368 K# ， 4.3 M#, 48 M#, 626 Mp, 8 Gfi. 

(b) Now they’re even better, but still growing exponentially: 30K/x，289 K/x ， 
2.3 M", 22 M^, 276 M/x, 1.7 Gfi. 

469 - For instance (饥 1 ), ( 仍 1 ),(仍 1 ) ， (" 12 ), (hi4), ( 仍 2 ), ( 说 2 ) ， ("23) ， ( 仍 3 ) ， e. 

470• There can’t be a cycle xo —> x\ = xo of length p > 1， because the 

largest vertex whose mate is changed always gets smaller and smaller mates. 

471. We must pair 2n with 1, then 2n — 1 with 2, •…, then n + 1 with n. 

472. We can number the vertices from 1 to mn in such a way that every 
4-cycle switches as desired. For example, we can make (i,j、< (i，j + 1) 

㈡ (ij)< (i + l ， j) ㈡ (i,j) mod4G {( 0 , 0 ), ( 0 , 1 ), ( 1 , 1 ), ( 1 , 2 ), ( 2 , 2 ), 

(2, 3) ，（ 3, 3) ，（ 3, 0)}. One such numbering in the 4x4 case is shown here- 

473. For every even-length cycle vo —— v\ - - - V 2 r-i —— vo with vo = maxvi and 

vi > v‘ 2 r—i, assert (voWV V 1 V 2 V U 2 V 3 V • • • V v^r-ivo). 

474. (a) (2n) . (2n - 2) . ... . 2 = 2 n n\. (b) (173)(173)(2525)(44)(6)(6). 

(c) Using 0, 1 , …， f for the 4-tuples 0000 ， 0001 ， … ， 1111， we must have 
/(o) = /(9) = / ⑸ ,/ ⑵ =/(b) = /(7); /(4) = /(8) = /(d); and /(6) = /(a) = /(f); 
in other words, the truth table of / must have the form abcdeagceagcfehg, where 
a,b,c,d,e, / ，分 ， ^ G {0,1}. So there are 2 8 

(d) Change c =^ to in (c). There are no such truth tables，because (igi) 
contains odd cycles; all cycles of an antisymmetry must have even length. 

(e) The 128 binary 7-tuples are partitioned into sixteen “orbits” {x^ xa, xa 2 ^ • • • }， 
with eight of size 12 and eight of size 4. For example, one of the 4s is {0011010, 0010110, 
0111110, 0110010}; one of the 12s is {0000000, 0011101,..., 1111000}. Hence there are 

2 16 functions with this symmetry, and 2 16 others with this antisymmetry* 

475. (a) 2 u+l n\. (There are 2 n+ 1 n!/a，if / has a automorphisms+antiautomorphisms.) 

(b) (xz)(xz)^ because (surprisingly) (z V y) 八 （x ㊉ 之） = (乏 V ?/) 八（乏 ㊉ 无 ） • 

(c) In general if cr is any permutation having a cycle of length Z, and if p is a 
prime divisor of Z，some power of a will have a cycle of length p. (Repeatedly raise a 
to the qth. power for all primes q ♦ p, until all cycle lengths are powers of p. Then, if 
the longest remaining cycle has length p e , compute the p e__1 st power.) 

(d) Suppose f(xi^X 2 ^xs) has the symmetry [xiX 2 X^){xiX 2 Xz)• Then /(0,0,0)= 
/(I ， 1 ， 0 ) = /( 0 , 1 ， 1 ) ， /(l, 1 , 1 ) = /( 0 , 0 , 1 ) = /( 1 , 0 , 0 ), so (xiX 2 )(xiX 2 ) is a symmetry. 

(e) A similar argument shows that (ux)(vw)(ux)(vw) is a symmetry. 

(f) If a is an antisymmetry of /, then a 2 is a symmetry. If / has a nontrivial 
symmetry, it has a symmetry of prime order p, by (c). And if p _ 2， it has one of 
order 2, by (d) and (e), unless n > 5- 

(g) Let f(xi ， … ， : r 6 ) = 1 only when x\.. .xq G {001000, 001001 ， 001011 ， 010000, 
010010, 010110,100000,100100,100101}. (Another interesting example, for n = 7, has 
f = 1 ■<=> xi... X 7 is a cyclic shift of 0000001 , 0001101 , or 0011101 ; 21 symmetries.) 

476. We want clauses that specify r - step chains in n variables, having a single out¬ 
put Xn^r. For 0 < t < t f < 2 n ，introduce new variables A tt ，= x^ n+r ) t ㊉ a:( n + r )r. 
(See ( 24 ).) Then for each signed involution a，not the identity，we want a clause that 
says is not a symmetry of /，’’ namely (\/{A tt f \ t < t r and i! — ter}). (Here t is 
considered to be the same as its binary representation (ti ... t n ) 2 ^ as in exercise 477-) 

Also, if a has no fixed points — this is true if and only if a takes Xi Xi for at 
least one i — we have further things to do: In case (b)，we want a clause that says V 
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is not an antisymmetry，” namely (\/{A tt f \ t < t f and t r = ta}). But in case (a)，we 
need further variables aj for 1 < j < T, where T is the number of signed involutions 
that are fixedpoint-free. We append the clause (ai V • • • V a ： r)，and also (aj V A tt f) for 
all t < t’ such that i! = ta when a corresponds to index j. Those clauses say, “there’s 
at least one signed involution that is an antisymmetry.” 

There are no solutions when n < 3. Answers for (a) are (((xi ®x‘ 2 ) Va ： 3 ) 八為） 0 xi 
and (((( 无 i ㊉ X 2 ) A ^ 3 ) ® xa) A Xh) ® xi\ in both cases the signed involution (11)(22) 
is obviously an antisymmetry. Answers for (b) are ((an ® X2) V X3) A {xa V xi) and 
(((xi A a> 2 ) ® X 3 ) A X 4 ) ® ( 尤 5 V x\). [Is there a simple formula that works for all n?] 

477. Use the following variables for 1 < /i < n < i < n + r, and 0 < t < 2 n : xu = 
(tth bit of truth table for Xi); ghi = [gh = Xi]\ Sijk = [xi =Xj Xk]^ for 1 < j < A: < i; 
fipq = for 0 < p^q < 1, p + ^ > 0 - (We don’t need fioo, because every operation 

in a normal chain takes (0, 0) 4 0.) The main clauses for truth table computations are 


(s^k V (xu ㊉ a)V (a:# ㊉ 6 ) V (以亡 ㊉ c) V (/— ®a)), for 0 < a, 6 ,c < 1 and 1 < j < k < i. 


Simplifications arise in special cases: For example, if 6 = c = 0, the clause is omitted 
if a = 0, and the term fioo is omitted if a = 1 - Furthermore if t = (ti ... t n ) 2 , and if 
j < n, the (nonexistent) variable Xjt actually has the known value again we omit 
either the whole clause or the term (Xjt ㊉ 6 )， depending on b and t. For example, there 
usually are eight main clauses that involve s^k] but there’s only one that involves sn ‘2 
when t < 2 n_2 5 namely (^12 V xn)^ because the truth tables for x\ and a ;2 begin with 
2 n "~ 2 0s. (All such simplifications would be done by a preprocessor if we had defined 
additional variables fioo and Xjt, and fixed their values with unit clauses.) 

There also are more mundane clauses, namely (ghi V xu) or (ghi V xu) according 
as gh(h, ••” t n ) = 0 or 1 ， to fix the outputs; also (V^n+i 9hi) and (vd 1 如 ), 
to ensure that each output appears in the chain and that each step has two operands. 
Additional clauses are optional, but they greatly shrink the space of possibilities: 

(vr=i 9ki v V?ti +1 V}=i s i'ji v Vj^'+i s i'ij) ensures that step i is used at least 

once; and (s^/c V 士 ’h) for i < % < n + r avoid reapplying an operand. 

Finally, we can rule out trivial binary operations with the clauses (/ioi V/noV/m )， 


(Aoi V/no V/m)，(/ioi V/ao V/ai)- (But beware: These clauses, for n < i < n + r, will 
make it impossible to compute the trivial function = 0 in fewer than three steps!) 
Further clauses such as (s^k V fioi V xu V Xjt) are true，but unhelpful in practice. 

478. We can insist that the (j, k) pairs in steps n+1， ••” n+r appear in colexicographic 
order; for example, a chain step like a ： 8 =X 4 ® X5 need never follow X7 = X2 A xq. The 


clauses，for n < i < n + r, are ( 哀 ij/cV if 1 < j 1 < j < k = k f < i or if 1 < j < k 
and 1 < j < k r < k < i. (If (j, k) = k r )^ we could insist further that fioifnofm is 
lexicographically less than /( i+ i) 0 i/(i+i)io/(i+i)n- But the author didn’t go that far.) 

Furthermore, if p<q and if each output function is unchanged when x v is swapped 
with x q , we can insist that x p is used before x q as an operand. Those clauses are 

(sijq V V n <^<i Vi< 7 /</c^<^ [/ =P or k ’ =P\ s i ， j ， k，) whenever j ^ p. 


For example，when answer 477 is applied to the full-adder problem, it yields M r 
clauses in N r variables，where (M 4 , M 5 ) = (942,1662) and N 5 ) = (82,115). The 
symmetry-breaking strategy above，with (p ， g) = (1 ， 2) and (2,3), raises the number of 
clauses to M’” where = (1025, I860). Algorithm C reported c unsaf after 

(1015,291) kilomems using (M 4 , M 4 ) clauses; c saf after (250,268) kilomems using 
(M 5 ， M‘）. With larger problems，such symmetry breakers give significant speedup 
when proving unsatisfiability，but they’re often a handicap in satisfiable instances- 
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479. (a) Using the notation of the previous answer, we have (Mg, Mg^Ng) = (14439, 
17273, 384) and (M 9 ,Mq,N 9 ) = (19719,24233,471). The running times for the ‘sat’ 
cases with Mg and Mg clauses were respectively (16, 645, 1259) and (66, 341 ， 1789) 
megamems — these stats are the (min, median ， max) of nine runs with different random 
seeds. The ‘unsat’ cases with Mg and M’ 8 were dramatically different: (655631 ， 861577, 
952218) and (8858,10908,13171). Thus 5 ( 4 ) = 9 in 7.1.2—( 28 ) is optimum. 

(b) But 5 ( 5 ) = 12 is not optimum，despite the beauty of 7.1.2—( 29 )! The Mu = 


76321 clauses in Nn = 957 variables are c saf in 680 Gfi, yielding an amazing chain: 


X 6 = Xi ® X 2 ^ X10 = XqW X 7 ^ 

X 7 = Xl ® X 3 ^ Xll = X 4 ㊉ XQ^ 

Xg = X4® X 5 ^ X12 = X9 ㊉ Xio^ 

X9 = X3® Zq = X13 = X 5 © Xn, 


X 14 = X 8 A Xll, 
之 1 = X 15 = XlO ® Xl 4 ^ 

Z 2 = Xi 6 = X12 A X15. 


And (M{ 0 ， iVio) = (68859,815) turns out to be ‘unsat’ in 1773 gigamems; this can be 
reduced to 309 gigamems by appending the unit clause (分 3(15 ))，since ( 7 (^ 4 , 5 ) = 10- 
Hence we can evaluate x\ H - h ^7 in only 5 + 11 + 2 + 1 = 19 steps, by computing 

(^ 1 ^ 0)2 = T 5 +X 6 +X 7 , (^2^12 ： o)2 = ri+X2+$3+a ： 4+ 从 0, {W2Z\)2 = 从 1+ 仍，么 2 = 仍 ㊉ 

(c) The solver finds an elegant 8 -step solution for (Mg, Ng) = (6068, 276) in 6 M#: 

X4 = Xi W X2^ X6 = X 3 ® 尤 4 ， X8 = 尤 3㊉ ^ 5 , Si = Xio = X6 A 

X b = Xi® $2 ， So = X7 = X3^/ S3 = X9 = X6 A Xg^ S2 = Xu = $7 ㊉ Xg - 


The corresponding (M^ N 7 ) = (5016,217) problem is ‘unsat’ in 97 Mfi. 

(d) The total cost of evaluating the S^s independently is 3 + 7 + 6 + 7 + 3 = 26, 
using the optimum computations of Fig. 9 in Section 7.1.2. Therefore the author was 
surprised to discover a 9-step chain for Si, S2, and using the footprint heuristic: 

= Xi ® X‘2, Xs = ^ 7 ， S3 = Xu = ^8 A T9, 

X 6 = Xi ® X3, Xq = Xq\J X7, S2 = X12 = Xs /\ Xio^ 

X7 = X3 ® 尤 4 ， Xio = X‘ 2 ㊉ $9 ， Si = X13 = Xg /\ Xio- 


This chain can solve problem (d) in 13 steps; but SAT technology does it in 12(!): 


X 5 = Xi ® X2^ Xq = Xq^J X7^ 

X 6 = Xi ® X3^ Xio = X 2 ® Xq^ 

X7 = X3® ^4 ， Xu = $5 V ^9, 

® 尤7， = X12 = Xg A Xio^ 


Si = X13 = Xs 八尤 10 , 
5^4 = XiA = Xi f\ Xll, 

^0 = 尤 15 = 為 V $11， 
义 2 = X\Q = Xg A Xu. 


The nonexistence of an 11-step solution can be proved via Algorithm C by a long 
computation (11034 gigamems)，during which 99,999,379 clauses are learned(l). 

(e) This solution (found in 342 G/x) matches the lower bound in exercise 7.1.2—80: 


X 7 = X\® ^ 2 , 

= ^3 0 尤 4, 

Xq = xi® 

Xio = x 6 ® Xg^ 


Xll = X 4 ® Xio^ 

X12 = 0 Xio^ 

X13 = 尤8 V 尤 11 ， 
Xi4 = X 7 ® Xi2^ 


X15 = X9 A X12, 
Xl 6 = X13 ® $ 15 , 
X17 = X14 A Xie- 


(f) This solution (found in 7471 G/i) also matches that lower bound: 

x 7 = Xi A X 2 ^ Xll = $5 © a；15 = 尤 8㊉ 尤 13 ， 

Xs = Xi ® X2^ XV2 = 尤 4 ㊉ 尤 11 ， XiQ = 尤 10 ㊉ 尤 14 ， 

$9 = $3㊉ $4 ， ^13 = ^9 ® Xll, X\7 = X7 ® X\Q^ 

Xio = x 5 A Xia = Xq\J Xi‘2, Xis = X15 V X17. 


Here xis is the normal function So ,4 = ^ 1 , 2 , 3 , 5 ,6 - We beat exercise 7-1.2-28 by one step. 

(g) A solution in t(3) = 12 steps is found almost instantaneously (120 megamems); 
but 11 steps are too few (‘unsat’ in 301 gigamems). 
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480. (a) Let xiX 2 X^xa = xix r yiy r . The truth tables for zi and are 0011010010001000 
and 01**1*00*011*011, where the *s (“don’t-cares”) are handled by simply omitting 
the corresponding clauses (ghi V 士 xh) in answer 477- 

Less than 1 gigamem of computation proves that a six-step circuit is ‘unsat' 
Here’s a seven - stepper, found in just 30 M#: $5 = ㊉ 尤 3, ^6 = ^3 V 尤 4, xs = xi ^ xq^ 

X7 = xi W X 5 ^ X9 = 尤 6㊉ X 7 ^ z\ = xio = X7 A xs^ z r = xn = 尤 3㊉ xg. (See exercise 
7.1-2-60 for a six-step solution that is based on a different encoding.) 

(b) Now we have the truth tables zi = 00110100010010000100100010000011, 
= 01**1*001*00*0111*00*011*01101**，if X 4 X 5 = yiy r . One of many 9-step 

solutions is found in 6.9 gigamems: xq = xi®X2, X7 = x‘2 ㊉ X5, $8 = ^ 40 ^ 6 , xg = X4 八 X7, 

Xio = Xl ㊉ X% Xll = X8 X12 = X3®X10, 2V = Xl3 = 尤 3 ㊉ 尤 11， Q = Xl4 = Xll Axi2. 

The corresponding clauses for only 8 steps are proved c unsaf after 190 Gfi of 
work. (Incidentally, the encoding of exercise 7.1.2-60 does not have a 9-step solution.) 

(c) Let c n be the minimum cost of computing the representation z\z r of (xi + 
• • • +x n ) mod 3 - Then (ci, 02 , 03 , 04 ) = (0, 2, 5, 7)，and c n _3 < c n + 9- Hence c n < 3n —4 
for all n > 2. [This result is due to A. Kojevnikov, A. S. Kulikov, and G. Yaroslavtsev, 
whose paper in LNCS 5584 (2009), 32-44, also inspired exercises 477-480.] 

Conjecture: For n > 3 and 0 < a < 2, the minimum cost of evaluating the (single) 
function [(a：i + • • • + x n ) mod 3 = a] is 3n — 5 — [(n + a) mod 3 = 0]- (It’s true for n < 5. 
Here’s a 12-step computation when n = 6 and a = 0, found in 2014 by Armin Biere: 

X7 = Xl® X2^ Xs = X3® X4^ Xq = Xl ® ^5 , 尤 10 = 尤 3 ㊉ ^ 5 , Xu = X 2 ® X\2 =^8 0 ^ 9 , 

£_13 = ^8 V T10, XiA = X7 ㊉ 尤 13 , 尤 15 = 无 12 八尤 13 ,尤 16 = Xu A X14, X17 = Xll © Xi 5 , 

夕 0 , 3,6 = x\s = x\q V X 17 - The case n = 6 and a _ 0， which lies tantalizingly close to 
the limits of today’s solvers, is still unknown. What is ... ^xq))1) 

481. (a) Since 2 ： ㊉ 2 / = {xix^x^) and z = x\ ㊉ x‘ 2 ㊉ $ 3 , this circuit is called a “modified 
full adder?’ It costs one less than a normal full adder, since z 1 = (xi ® X 2 ) ® X 3 and 
2 ： = (xi ®$ 2 ) V (xi ® (And it’s the special case 从 = 0 of the more general situation 
in exercise 7.1.2—28.) Part (b) describes a “modified double full adder •” 

(b) The function 2:2 has 20 don’t - cares，so there are many eight-step solutions 
(although 7 are impossible); for example, xq = xi ㊉ 尤 5 , X 7 = X 2 Z 3 = xs = ^30^6, 
Xq = X4® Xio = XiW X 7 ^ Xll = X3 /\ Z2 = X12 = X 6 ㊉ Xn^ ZI = X 13 = Xio ㊉ Xll. 

(c) Letting — [^ 2 /c-i^ 2 /c] 5 it suffices to show that the binary represen¬ 
tation of Tj n = "[yi^/ 2 ] + • • • + — i?/ 2 n】+ ?/ 2 n+i can be computed in at most 8n 

steps. Four steps are enough when n = 1 - Otherwise, letting co = ?/ 2 n+i, we can 
compute 2 ：’s bits with z/[t/ 4 /c-3?/4/c-2] + v{y A k-iyAk\ + Ck 一 i = ^{z 2 k-iZ 2 k\ + c k for 
1 < A: < [n/2\. Then S n = 2( "[之 1 么 2 】 + • • • + " 卜 n - i 之 n】）+ c n / 2 if n is even, E n = 

-- 1 】 + Zn)+C’ if n is odd，where "[|/ 2 n—l?/ 2 nl+CL n / 2 」= 2z n +C ^ 

at a cost of 4n in both cases. The remaining sum costs at most 8Ln/2」by induction. 
[See E. Demenkov ， A. Kojevnikov ， A. S. Kulikov, and G- Yaroslavtsev, Information 
Processing Letters 110 (2010), 264-267-] 

482. (a) — 1) is odd when k is odd，and it’s 士 1 when k = 1. 

(b) Adapting Sinz’s cardinality clauses as in exercises 29 and 30, we only need the 
auxiliary variables aj = s; 一 1 ， bj = s J ” and Cj = s 3 - +l ^ because s 3 - +2 = 0 and = 1. 
The clauses are then (6j Vaj+i) A (cj V6j+i) A (6j Vcj) A (aj+i V6j+i), for 1 < j < t/2 — l\ 
and (y2j-2 V aj) A (^ 2 j-i V dj V bj) A (y 2j V bj V cj) A fe+i V c^) A ( 物 -2 V cj-i) A 
V Cj-i V bj) A (y 2 j V bj V dj + i) A (t/ 2 j+i V aj+i) for 1 < j < tj2, omitting & ， c 0 , 
and the two clauses that contain yo. 
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(c) Use the construction in (b) with yj = Xjd for 1 < d < n/3 and independent 
auxiliary variables aj ， 山 bj ， d, Cj ， d. Also, assuming that n > 720， break symmetry by 
asserting the unit clause (x 72 o)- (That’s much better than simply asserting (xi).) 

This problem was shown to be satisfiable if and only if n < 1161 by B. Konev 
and A. Lisitsa [ArtiBcial Intelligence 224 (2015) ， 103—118], thereby establishing the 
case (7 = 2 of a well-known conjecture by Paul Erdos [Michigan Math. J. 4 (1957 )， 
291—300, Problem 9]. Algorithm C can prove unsatisfiability for n = 1161 in less than 
600 gigamems, using the parameters of exercise 512. 

483. Using a direct encoding as in ( 15 )，with Vjk meaning that Vj has color we can 

generate the clauses (vjk) for 1 < j < k < d and V \J\Z\ Vik) for 2 < A: < j < n. 

A similar but slightly simpler scheme works with the order encoding，when Vjk means 
that Vj has color > k. [See Ramani ， Markov ， Sakallah，and Aloul ，Journal of Artificial 
Intelligence Research 26 (2006) ， 289-322. The vertices might be ordered in such a way 
that degree(vi) > ••- > degree(i; n ), for example.] 

Those book graphs can be colored optimally with (11, 11, 13 ， 11, 10) colors, re¬ 
spectively- Such colorings are found with less than a megamem of work by Algorithm W 
or Algorithm C, without any symmetry breaking; Algorithm L also finds them, but after 
more than an order of magnitude more effort. The symmetry breaking clauses actually 
will retard this search, especially in the case of homer. On the other hand when we ask 
for only (10, 10, 12, 10, 9) colors those clauses are extremely helpful: The runtime for 
anna and david decreases from about 350 Gfi to only about 200 K/x with Algorithm C! 
For huck and jean the reduction is roughly 333 Gfi 833 M// and 14Gfi —> 4.3 Mfi] 
for homer, dozens or more of T/x go down to about 11 Gfi. (Algorithm L is hopelessly 
slow on these unsatisfiable coloring problems, even with symmetry broken.) 

484. (a) A type (iii) move will work if and only if vi —— V 2 —— V 2 —— V 3 . 

(b) For 0 < t < n — 1 we have the clause (V^i " 1 V V? 二 / 一 3 々 ,/)， as well as 
the following forl<i<j<n — t^l<k<n — t^l<l<n — t — 2: ( 办 ， /c V xt,k,k+i)] 

(st.iWxt, 1 , 1 + 3 )； here % = i + [i>k], j 1 = 

j + [j > k]^ and {〆’,/’} are the min and max of {« + [«>/ + 3] + 3[i =/], j + [j > / + 3] + 
3[j = /]}• Finally there’s a unit clause for all 1 < i < j < n with Vi -j-Vj. 

(These clauses essentially compute [G is quenchable], which is a monotone Bool¬ 
ean function of the ⑵ elements above the diagonal in the adjacency matrix of G. The 
prime implicants of this function correspond to certain spanning trees, of which there 
are respectively 1 ， 1 ， 2, 6 , 28 ， 164 ， 1137，… when n = 1 ， 2, 3, 4, 5, 6 , 7，•…) 

485. Let i! = f + 1. Instances of commutativity are: O (qt,k ， +i,qt ， ,k) H 

k < (s t ， h s t 、 v ) H (s t ^ +1 ,s t ^i) if / + 2 < {q t ,k,s t ^ v ) ^ ，仏 v ) if A: < Z’; 

e if / + 2 < A :’； (s t ,i,s t ^i) ^ (qt ， i+ 3 , s t ，“)• These can be 

broken by appending the clauses V qv,k), (4，"+i V s t ^i), ••” （办， z +3 V 〜，/)• 

Endomorphisms are also present in the two cases (qt.k^Qt^k) ^ (qt ， k+i ， qt 、 k) and 
(st^k+i^ Qt^k) ^ (% ， /c+i, 即， fc )，provided that both pairs of transitions are legal. These 
are exploited by the clauses (qt,k+i V q t ^ k V x t ,k,k+i) and (qt,k+i V s t ^ k V x t ,k+i,k+4)^ 

486. This game is a special case of graph quenching, so we can use the previous 
two exercises. Algorithm C finds a solution after about 1.2 gigamems, without the 
symmetry-breaking clauses; this time goes down to roughly 85 megamems when those 
clauses are added- Similarly, the corresponding 17-card problem after A 辜 x J 奉 is found 
to be unsatisfiable，after 15 G// without and 400 Mfi with. (A 辜 x x 10 辜 fails too.) 

Those SAT problems have respectively (1242, 20392, 60905) ， (1242, 22614, 65590 )， 
(1057 ， 15994 ， 47740) ， (1057 ， 17804 ， 51571) combinations of (variables ， clauses, cells), 
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and they are not handled easily by Algorithms A, B, D, or L. In one solution both 
仇 ii and 5o,7 are true, thus providing two ways to win(!)，when followed by gi ， i 5 , s‘ 2 , 13 , 

^3,12 5 5 4,10 5 ^5,7, q6,7, 57 , 5 , q8,5, 59 , 4 , 奶 0,5, $11 ， 3, 奶 2,3, 方 13 ， 1 ， $14 ， 1 ，奶 5 ， 1 ， ^16,1- 

Notes: This mildly addictive game is an interesting way to waste time in case 
you ever get lost with a pack of cards on a desert island- If you succeed in reducing the 
original 18 piles to a single pile, you can continue by dealing 17 more cards and trying 
to reduce the new 18 piles- And if you succeed also at that, you have 17 more cards 
for a third try, since 52 = 18 + 17 + 17 - Three consecutive wins is a Grand Slam. 

In a study of ten thousand random deals, just 4432 turned out to be winnable. 
Computer times (with symmetry breaking) varied wildly, from 1014 K/i to 37 Gfi in 
the satisfiable cases (median 220 M//) and from 46 K/x to 36 G// in the others (median 
848 M//). The most difficult winnable and unwinnable deals in this set were respectively 

9 秦 7 辜 3 辜 K<0> 7 秦 3G 2<C> 8* 6G J<0> 8 秦 2G 6 秦 4<0> 5 秦 4G 10<C> Q 秦 and 

20 90 7* 7<0> 89 K* 3<> 10* 3* 3 秦 Cj 秦 8* 2* 6 今 5 燊 . 

Students in Stanford’s graduate problem seminar investigated this game in 1989 
[see K. A. Ross and D. E. Knuth，Report STAN-CS-89-1269 (Stanford Univ., 1989 )， 
Problem 1]. Ross posed an interesting question, still unsolved: Is there a sequence of 
(say) nine “poison cards，” such that all games starting with those cards are lost? 

The classic game Idle Year is also known by many other names, including Tower of 
Babel, Tower of London, Accordion, Methuselah, and Skip Two- Albert H. Morehead 
and Geoffrey Mott-Smith，in The Complete Book of Solitaire and Patience Games 
(1949) ， 61， suggested that moves shouldn’t be too greedy- 

487. Every queen in a set of eight must attack at least 14 vacant cells* Thus \dS\ gets 
its minimum value 8 x 14 = 112 when the queens occupy the top row- Solutions to the 
8 queens problem, when queens are independent, all have |^>S| < 176. The maximum 
\dS\ is 184, achieved symmetrically for example in Fig. A—9(a). (This problem is 
not at all suitable for SAT solvers, because the graph has 728 edges- The best way 
to proceed is to run through all ( 6 g 4 ) possibilities with the revolving door Gray code 
(Algorithm 7.2.1.3R)，because incremental changes to \dS\ are easy to compute when 
a queen is deleted or inserted. The total time by that method is only 601 gigamems.) 

The maximum of |U| is obviously 64 — 8 = 56. The minimum, which corre¬ 
sponds to Turton’s question, is 45; it can be achieved symmetrically as in Fig. A—9(b )， 
leaving 64 — 8 — 45 = 11 cells unattacked (shown as black queens). In this case SAT 
solvers win: The revolving door method needs 953 gigamems，but SAT methods show 
the impossibility of 44 after only 2.2 G// of work. With symmetry reduction as in the 
following exercise, this goes down to 900 Mfi although there are 789 variables and 4234 
clauses. [Bernd Schwarzkopf, in Die Schwalbe 76 (August 1982) ， 531, computed all 
solutions of minimum |U |， given |5|，for n x n boards with n < 8. Extensions of 
Turton’s problem to larger n have been surveyed by B. Lemaire and P- Vitushinskiy 
in two articles, written in 2011 and accessible from www.ffjm.org. Optimum solutions 
for n > 16 are conjectured but not yet known.] 

All sets S of eight queens trivially have |^i n >S| = 8. 

488• Let variables Wij and bij represent the presence of white or black queens on 
cell with clauses (wij V b^jf) when (i ， j)=(〆,/) or [i,j) —— (〆，/)• Also, if each 

army is to have at least r queens，add clauses based on ( 20 ) and ( 21 ) to ensure that 
^2 Wij > r and ^2 bij > r. Optionally, add clauses based on Theorem E to ensure that 
k of the w variables for the top row are lexicographically greater than or equal to the 
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Fig. A— 9. Optimum queen placements of various kinds. 
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corresponding k variables in fifteen symmetrical variants. (For instance, if A: = 3， we 
might require W 11 W 12 W 13 > 6 in^ 2 n^ 3 n 5 thus partially breaking the symmetries.) 

The maximum army sizes for 1 < n < 11 are found to be 0, 0, 1, 2, 4, 5, 7, 9, 12, 
14, and 17, respectively. A construction with 21 armies is known for n = 12, but 22 has 
not yet been proved impossible. [B. M. Smith, K. E. Petrie, and L R Gent obtained 
similar results using CSP methods in LNCS 3011 (2004) ， 271-286.] An extra black 
queen can actually be included in the cases n = 2 ， 3 ， 4 ， 6 ， 8 ， 10， and 11. Solutions 
appear in Fig. A-9; the construction shown in Fig. A—9(d) generalizes to armies of 
2q(q + 1 ) queens whenever n = + 1 ， while those in parts (c) ，（ e) ，（ f) belong to 

another family of constructions that achieve the higher asymptotic density ^n 2 . 

When n = 8 and r = 9, Algorithm C typically finds a solution in about 10 
megamems (A: = 0 )， or about 30 megamems (k = 3); but with r = 10 it typically 
proves unsatisfiability in about 1800 M// (k = 0) or 850 Mfi (k = 3) or 550 M/x 
{k = 4) or 600 Mfi {h = 5) - Thus the symmetry breaking constraints are helpful 
for unsatisfiability in this case, but not for the easier satisfiability problem. On the 
other hand, the extra constraints do turn out to be helpful for both the satisfiable 
and unsatisfiable variants when n is larger. The “sweet spot” turns out to be A: = 6 
when n = 10 and n = 11; unsatisfiability was proved in those cases, with r = 15 and 
r = 18, after about 185 Gfi and 3500 G/i, respectively. [See Martin Gardner, Math 
Horizons 7,2 (November 1999) ， 2—16, for generalizations to coexisting armies of sizes 
r and s. F. R. K. Chung and R. L. Graham conjecture that the maximum value of 
if r = 3q 2 + 3g + 1 ， is asymptotically n 2 — (6^ + 3)n + 0(1).] 

489 • To = 1， Ti = 2, = 2T n _i + (2n — 2 )T n _2 (see Eq. 5-1. 4 -( 40 )). The generating 

function T n z n 丨 n\ and the asymptotic value are given in exercise 5-1.4-31. 

490. Yes. For example, using the signed permutation 4132, we’re allowed to assume 
that some solution satisfies X 4 X 1 X 3 X 2 < x\x 1 X^X 2 for every endomorphism —— because 
the solution with lexicographically smallest xax\x^X 2 has this property. Notice that 
the signed permutation 12 ... n converts ‘<’ to ‘ 

491. Let (T be the permutation (12 3 412 3 4). Then a 4 = (1 1)(22)(33)(44); and by 
Theorem E we need only search for solutions that satisfy xix‘ 2 尤 3 工 4 S X 1 X 2 X 3 X 4 - We^re 
therefore allowed to append the clause (xi) without affecting satisfiability. 

(We actually are allowed to assert that x\ = X 2 = xa = 0, because 0000 and 0010 
are the lex-leaders of the two 8 -cycles when a is a written as a permutation of states.) 

In general if an automorphism is a permutation of literals having a cycle that 
contains both v and for some variable we can simplify the problem by assigning 
a fixed value to v and then by restricting consideration to automorphisms that don’t 
change v. (See the discussion of Sims tables in Section 7.2.1.2.) 

492 • Suppose xi .. .x n satisfies all clauses of F; we want to prove that ... x n )r = 
x [… x n also satisfies them all. And that’s easy: If (Zi V … V 4) is a clause, we have 
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l[ = ZiT ，• • • ， l’ n = l n r\ and we know that (hr V • • • V 4r) is true because it’s subsumed 
by a clause of F. [See S. Szeider, Discrete Applied Math. 130 (2003), 351—365 」 

493. Using the global ordering pi.. .p 9 = 543219876 and Corollary E, we can add 

clauses to assert that X 5 = 0 and < xqxtx^xq. A contradiction quickly fol¬ 

lows, even if we stipulate only the weaker relation xa < because that forces xq = 1 . 

494. Exercise 504(b) shows that {uv){uv) is a symmetry of the underlying Boolean 
function, although not necessarily of the clauses F. [This observation is due to Aloul, 
Ramani ， Markov，and Sakallah in the cited paper.] The other symmetries allow us to 
assert (i) (xi V Xj) A (xj V Xk)^ (ii) (xi V Xj) A (xj V Xk)^ (iii) (xi V Xj) A (xj V Xk)- 

495. Suppose, for example, that m = 3 and n = 4. The variables can then be called 
11, 12 ， 13, 14, 21, … ， 34; and we can give them the global ordering 11, 12, 21, 13, 22, 
31, 14, 23, 32, 24, 33, 34. To assert that 21 22 23 24 < 31 32 33 34, we use the involution 
that swaps rows 2 and 3; this involution is (21 31) (22 32) (23 33) (24 34) when expressed 
in form ( 192 ) with signs suppressed. Similarly we can assert that 12 22 13 < 13 23 33 
because of the involution (12 13)(22 23)(32 33) that swaps columns 2 and 3. The same 
argument works for any adjacent rows or columns. And we can replace by by 
complementing all variables. 

For general m and n, consider any global ordering for which Xij precedes or equals 
Xj/y when 1 < i < % < m and 1 < j < / < n. The operation of swapping adjacent 
rows makes the global lexicographic order increase if and only if it makes the upper 
row increase lexicographically; and the same holds for columns- 

[See Ilya Shlyakhter, Discrete Applied Mathematics 155 (2007) ， 1539-1548.] 

496. No; that reasoning would “prove” that m pigeons cannot fit into m holes- The 
fallacy is that his orderings on rows and columns aren’t simultaneously consistent with 
a single global ordering, as in the previous exercise. 

497. A BDD with 71,719 nodes makes it easy to calculate the total ， 818,230,288,201 ， 
as well as the generating function 1 + z + 3z 2 + 8z 3 + 25z 4 + • • • + 21472125415z 24 + 
31108610146^ 25 + ... + 1026872113U 39 + 6 1 52 8 3 65 1 8 之 4Q + ... + 24z 60 + 8z 61 + 3z 62 -h 
z 63 -hz 64 . (The relatively small coefficients of z 39 and z 40 help account for the fact that 
> was chosen in ( 185 )-( 186 ); problems with sparse solutions tend to favor >.) 

[Polyaks theorem in Section 7.2.3 shows that exactly 14,685,630,688 inequivalent 
matrices exist; compare this to 2 64 ^ 1.8447 x 10 19 without any symmetry reduction.] 

498. Consider the global ordering xoi ， xn^ •… ， x m i ； X 12 , 尤 22 , •… ， x m 2 ^ X 02 ] 尤 ‘ 23 , ^ 33 , 

• •” X m3 , X 03 , X 13 ； •… ； Xmm, x 0 m, ••” There’s a column symmetry 

that fixes all elements preceding 工 )" and takes 

499. No. The unusual global ordering in answer 498 is not consistent with ordinary 
lexicographic row or column ordering. [Nor can the analogous clauses (xu V Xij) for 
1 < i < m and i < j < n be appended to ( 185 ) and ( 186 ) • No quad-free matrix for 
m = n = 4 and r = 9 satisfies all those constraints simultaneously-] 

500. If Fo has a solution, then it has a solution for which l is true. But (Fo U Fi) | / 

might be unsolvable. (For example, let Fo = (xi V X 2 ) A ( 无 2 V xi), which has the 

symmetry 12; so we can take S = (xi)^ l = x\. Combine that with F\ = (xi).) 

501- Let Xij denote a queen in cell (i ， j)，for 1 < i < m and 1 < j < n- Also 

let rij = [xn + • • • + 2 1] and r[j = [xn + • • • + A(j+i) 2 2]，for 1 < i < m and 

1 < j < n. Using ( 18 ) and ( 19 ) we can easily construct about 8mn clauses that 
define the r’s in terms of the x^s and also ensure that xn + • • • + Xi n < 2. Thus 
= [xn H - h Xi n =2]; call this condition 
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Similar conditions Cj, and bd are readily established for column j, and for the 
diagonals with i+j = d+1 or i—j = d_n, for 1 < i < m, 1 < j < n, and 1 < d < m+n. 
Then condition (ii) corresponds to the ran clauses (xij V V Cj V a^j-i V bi 一 
Finally we have clauses from ( 20 ) and ( 21 ) to ensure that Xij < r. 

When m = n, the lower bound r > n — [n mod4 = 3] has been established by 
A. S. Cooper, O. Pikhurko, J. R. Schmitt, and G. S. Warrington [AMM 121 (2014), 
213—221], who also used backtracking to show that r > 12 on an 11 x 11 board. SAT 
methods, with symmetry breaking, yield that result much more quickly (after about 9 
teramems of computation); but this problem，like the tomography problem of Fig. 36, 
is best solved by integer programming techniques when m and n are large. 

If we call the upper left corner white，solutions with m = n = r — 1 and all queens 
on white squares appear to exist for all n > 2， and they are found almost instantly- 
However，no general pattern is apparent- In fact，when n is odd it appears possible to 
insist that the queens all appear in odd-numbered rows and in odd-numbered columns- 
Here are examples of optimum placements on smallish boards- The solutions for 
8 x 9, 8 x 10, 8 x 13, 10 x 10, and 12 x 12 also work for sizes 8 x 8, 9 x 10, 8 x 12, 9 x 9, 
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This placement of ten queens on a 10 x 10 board can be described by the “magic 
sequence” （ ai ， … ， a 5 ) = (1 ， 3, 7, 5, 9), because the queens appear in positions (a“ a$+i) 
and (ai+i ， a《）for 1 < i < n/2 as well as in (ai ， ai) and (a n / 2 ,a n / 2 ) - The magic 
sequences (1, 3, 9, 13 「 15, 5, 11, 7, 17) and (9, 3, 1, 19, 5, 11, 15, 25, 7, 21, 23, 13, 17) 
likewise describe optimum placements for n = 18 and 26. No other magic sequences 
are known; none exist when n = 34. 

502. For each j, construct independent cardinality constraints for the relation x[^ + 

…+ < rj, using say ( 20 ) and ( 21 ), where = (Sjk? Xk)- 

503. The Hamming distance d(x,y) = ㊉ y) between binary vectors of length n 

satisfies d(x^ y) + d(x^ y) = n. Thus there is no x with d(x, Sj) > rj + 1 for all j if and 
only if there is no x with d(x^Sj) < n — 1 — rj for all j. [See M. Karpovsky ， IEEE 

Transactions IT-27 (1981), 462-472.] 

504. (a) Assume that n > 4. For strings of length 2n we have d(z, w) + d(z, w) = 2n; 

hence d(z^w) < n and d(z^ w) < n if and only if d(z ， w) = d(z^ w) = n. Every string 
% with Z 2 k-i 7 ^ Z 2 k for 1 < A: < n satisfies d(z ， Wj) = n for 1 < j < n- Conversely, if 
d(z,Wj) = d(z ， Wk) = n and 1 < j < A: < then Z 2 j-i + Z 2 j = z^k-i + ^ 2 /c- Thus if 
Z 2 j-i = Z 2 j for some j we have 2： = 00 • • • 0 or 11 • • • 1 , contradicting d(z, wi) = n. 

(b) For each string x = X 1 X 1 X 2 X 2 - - - x n x n that satisfies part (a) we have d(x^ y )= 

2li + 2h + 2『3 + n — 3, which is < n + 1 if and only if (/ 1 V /2 V I 3 ) is satisfied. 

(c) Let Sj = Wj and rj = n for 1 < j < 2n; let s‘ 2 n +/c = Vk and r^n-vk = n + 1 

for 1 < A: < where yk is the string in (b) for the kth clause of F. This system 

has a closest string x = ^ 1 X 1 X 23:2 •… 无 if and only ii xi ... x n satisfies every clause. 
[A similar construction in which all strings have length 2n + 1 and all rj are equal 
to n + 1 is obtained if we append the bit [n<j< 2n] to each Sj• See M. Frances and 
A. Litman, Theory of Computing Systems 30 (1997), 113-119.] 
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(d) Boilerplate 11000000, 00110000, 00001100, 00000011, 00111111, 11001111, 
11110011, 00000011, at distance < 4; for the clauses, 01011000, 00010110, 01000101, 
10010001, 10100100, 00101001, 10001010, and possibly 01100010, at distance < 5. 

505. (For A: = 0 ， 1, ••” n — 1 one can set j to a uniform integer in [0 . .k] and 
INX Ik + 1] j; also if j = k set VAR [A:] A: + 1， otherwise i VAR[j] ， VAR[A:] i i ， 
INX[i] ^r- VAR [j] A: + 1.) With nine random seeds, typical runtimes for D3 are 
(1241, 873, 206, 15, 748, 1641, 1079, 485, 3321) M". They’re much less variable for the 
unsatisfiable K0, namely (1327, 1349, 1334, 1330, 1349, 1322, 1336, 1330, 1317) M"; 
and even for the satisfiable W2: (172, 192, 171, 174, 194, 172, 172, 170, 171) Mfi. 

506. (a) Almost true: That sum is the total number of clauses of length > 2, because 
every such clause of length k contributes 1/(^) to the weights of ⑵ edges- 

(b) Each of the 12 2 — 2 = 142 cells of the mutilated 12 x 12 board contributes 
one positive clause (vi V • • • V Vk) and (g) negative clauses {vi V 巧 )， when that cell can 
be covered by k potential dominoes { 仍 ， • • •, v/c}. So the weight between u and v is 2, 
4/3, or 7/6 when dominoes u and v overlap in a cell that can be covered in 2, 3, or 4 
ways- Exactly 6 cells can be covered in just 2 ways (and exactly 10 2 in 4 ways)- 

(The largest edge weights in all of Fig. 52 are 37/6, between 20 pairs of vertices 
in K6- At the other extreme, 95106 of the 213064 edges in X3 have the tiny weight 
1/8646， and 200904 of them have weight at most twice that much.) 

507. Consider, for example, the clauses (u V i), (v W t)^ (u\/ v W t)^ {u V (v V F), 
(u\/ v\/ t f ) from (24). Looking ahead from t = 1 yields the windfall (f V 〆），and looking 
ahead from t r =1 yields (? V t). Henceforth Algorithm L knows that t equals t r . 

508. According to (194)，the purging parameters were A p = 1000 and = 500; thus 
we have learned approximately 1000A: + 500(g) clauses when doing the kth. purging 

phase. After 1000L clauses this works out to be ^ (\/16L + 9 — 3)/2 phases，which is 
s 34.5 when L = 323 - (And the actual number was indeed 34.) 

509. One remedy for overfitting is to select training examples at random. In this case 
such randomness is already inherent，because of the different seeds used while training. 

510. (a) From Fig. 53 or Fig. 54 or Table 7 we know that T1 < T2 < L6 in the median 
rankings; thus T2 obscures L6 and Tl. 

(b) Similarly, L8 < M3 < Q2 < X6 < F2 < X4 < X5; X6 obscures L8 and X4. 

(c) X7 obscures K0, K2, and (indirectly) A2, because K2 obscures K0 and A2. 

511. (a) Nine random runs finished in only (4.9, 5.0, 5.1, 5.1, 5.2, 5.2, 5.3, 5.4, 5.5) 

(b) Nine random runs now each were aborted after a teramem of trials. (No theo¬ 
retical explanation for this discrepancy, or for the wildness of P4 in Fig. 54, is known.) 

(c) (0.2, … ， 0.5, • • • ， 3.2) M/i without; (0.3, … ， 0.5, • • • ， 0.7) M" with. 

512. A training run with ParamILS in 2015 suggested the parameters 

a = 0.7, p = 0.998, g = 0.99995, A p = 100000, = 2000, 

r = 10, w = l, p = 0, P = 0.05, 0 = 0.166667 ， (*) 


which produce the excellent results in Fig. A-10. 

513. After training on rand {3, 1062, 250, 314159)，ParamILS choose the values a = 3-5 
and © = 20-0 in (195)，together with distinctly different values that favor double 
lookahead，namely /5 = .9995, Y = 32. [The untuned values a = 3.3, p = .9985, 
© = 25.0, and Y = 8 had been used by the author when preparing exercise 173.] 
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514. ParamILS suggests p = -85 and N = 5000n; that gives a median time ^ 690 Mp. 
(But those parameters give horrifically bad results on most other problems.) 


515. Use variables S“k meaning that cell (i ， j) in the solution holds A: ， and Zij meaning 
that cell (i^j) is blank in the puzzle- The 729 S variables are constrained by 4 x 81 x 
(1 + ( 2 )) = 11,988 clauses like ( 13 ) - From condition (i)，we need only 41 variables Zij. 
Condition (ii) calls for 15 clauses such as (Z 11 V. • -VZig), (Zn V- - -V^i VZ 49 V- - .VZi 9 )， 
(Z 15 V • • • V Z 55 ), (Z 44 V Z 45 V Z 46 V Z 54 V Z 55 ), when equal Z^s are identified via (i)- 
Condition (iii) ， similarly，calls for 28 clauses such as (Zu V Z 12 V 艺 13 )， (^n V Z 21 V ^ 31 )， 
(Z 45 V Z 55 ). Condition (vi) is enforced by 34,992 clauses epitomized by (Sm V Zu V 
S 122 V Z \2 V *^412 V -Z 4 I V S 42 I V Z/^ 2 ) - 


For conditions (iv) and (v)，we introduce auxiliary variables Vijk = Sijk A Zij ^ 
meaning that k is visible in (i ， j); Rik — Vnk V • • • V Vi^k^ meaning that k is visible 
in row i; Cjk = Vijk V • • • V Vgj/c, meaning that k is visible in column j. Also B^k = 
V 〈 i ， j 〉 = 6 ^ijk^ meaning that k is visible in box 6 ; here 〈 i ， j〉= l+3[(i —l)/3」+ L(j —1)/3」. 
Then = Zij A A Cjk A means that A: is a possible way to fill cell (i, j) 

without conflict- These 1701 auxiliary variables are defined with 8262 clauses- 


Condition (iv) is enforced by nine 9-ary clauses for each i and j, stating that we 
mustn’t have exactly one of {Piji^ … ， -P^g} true. Condition (v) is similar，enforced by 
three sets of 81 x 9 clauses of length 9; for example, one of those clauses is 


(P417 V P427 V P437 V P517 V P527 V P537 V P 617 V P &27 V ^637)- 
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(“We aren’t obviously forced to put 7 into box 4 by using cell (5, 2 )?’） 

Finally, some of the symmetry is usefully broken by asserting the unary clauses 
SikkNZ\\/\Z\ 2 . The grand total is 58,212 clauses with 351,432 cells, on 2,471 variables. 

(This problem was suggested by Daniel Kroening. There are zillions of solutions ， 
and about one in every five or six appears to be completable uniquely to the setting 
of the S variables. Thus we can obtain as many “hard sudoku” puzzles as we like, by 
adding additional unary clauses such as 5553 八之 17 more or less at random 5 then weeding 
out ambiguous cases via dancing links. The clauses are readily handled by Algorithms 
L or C, but they’re often too difficult for Algorithm D. That algorithm did, however, 
find the uniquely completable solution (a) below after only 9.3 gigamems of work.) 

If we beef up condition (iii), insisting now that no box contains a row or column 
with more than one blank, condition (vi) becomes superfluous- We get solutions such 
as (b) below, remarkable for having no forced moves in spite of 58 visible dues, yet 
uniquely completable- That puzzle is, however, quite easy; only 2, 4, 7 are unplaced- 


1 . . . . 6.8. 

1.3.56.89 

1 .3.5.7.. 

1.3.56.89 

5.87214.6 

59738.61. 

. 5.79•••1 

68.3.91.5 

.6.38.2.1 

68.1.93.5 

7....125. 

.9518.63. 

84.••3••5 

956.318.7 

..1..5.76 

3.896..51 

..5.6.8.. 

.315.896. 

..5.7.1.. 

.195.836. 

6..8...42 

2.896.153 

47.1..5.. 

56..319.8 

3.6.48.2. 

8.96.5.31 

.185••••7 

.56.9381. 

4.76321.8 

•65.13298 

5... 87.1. 

8.16.5.93 

.8.5....4 

31.89.5.6 

..7.1.8.5 

93.81.5.6 

(a) 

(b) 

(C) 

(d) 


We might also try to strengthen conditions (iv) and (v) by requiring at least three ways 
to make each choice, not just two- Then we get solutions like (c) above- Unfortunately ， 
however, that one is completable in 1237 ways! Even if we also strengthen condition (iii) 
as in (b)，we get solutions like (d)，which can be completed in 12 ways. No uniquely 
completable sudoku puzzles are known to have such ubiquitous threefold ambiguity. 

516. This conjecture can be expressed in several equivalent forms. R. Impagliazzo and 
R. Paturi [JCSS 62 (2001) ， 367-375] defined Sk = inf {lg r | there exists an algorithm 
to solve /cSAT in r n steps}，and stated the exponential time hypothesis: S 3 > 0. They 
also defined Soo = lim/e-^oo Sk^ and proved that 外 < (1 — d/k)soo for some positive 
constant d. They conjectured that 5oo = 1 ； this is the strong exponential time 
hypothesis. An alternative formulation [C- Calabro, R. Impagliazzo, and R. Paturi, 
IEEE Conf. on Computational Complexity 21 (2006) ， 252-260] was found later: “If 
t < 2, there is a constant a such that no randomized algorithm can solve every SAT 
problem with < an clauses in fewer than r n steps, where n is the number of variables.” 

517. (a) If there are n variables, introduce ( 2 ^) new variables ll f = 1% one for each 
pair of literals {/，/’}, with the equations "’ + /『’ + f = 1. Similarly, introduce ( 2 :) 
variables H’Z"，via II’ l’’ + ll f l n + IV + T = 1 - Then the ordinary ternary clause l\! I’ M l" 
is true if and only if we have ll'l" + ll'l" -1- ll'l" + ll'l" + ll'l" + ll'T" + llT = 1. 

(b) Remove clauses of length > 3 by using the fact that Zi + • • • + 4 = 1 if and 

only if h H - + /j +1 = 1 and 匕 +i H - h //c +1 = 1, where t is a new variable. Also, 

if a, 6, c ， and d are new variables with a + 6 + d = a + c + d = 1， beef up short clauses 
using / + /’ = 1 / + /’ + a = 1 and l = 1 [ + 6 + c = 1. 
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[Thomas J- Schaefer proved the NP-completeness of 1 - in-3 SAT as a special case 
of considerably more general results，in STOC 10 (1978), 216-226.] 


518. (a) A = where x = 工 0 , y 

(b) Twice in the n variable rows and n variable columns; once in the 3m output 
rows and 3m input columns; never in the 3m input rows and 3m output columns. 

(c) By (a)，each way to choose 2s in different rows and columns contributes zero 
to the permanent unless, in every clause, the subset of chosen inputs is nonempty and 
matches the chosen outputs. In the latter case it contributes 16 m 2 n . [See A. Ben-Dor 
and S- Halevi, Israel Symp. Theory of Computing Systems 2 (IEEE, 1993), 108-117.] 


-10 
1 o 


519. The unsatisfiable problem corresponding to D1 and D2 has median running time 
2099 (losing to both factor_fifo and factorJifo). The satisfiable one corresponding 
to D3 and D4 is unstable (as in Fig. 54)，with median 903 M/i (winning over both). 

520. (Solution by Sven Mallach ， 2015, using solvers X and Y ， where X was CPLEX 12.6 
and Y was GUROBI 6, both used with emphasis on mixed-integer-program feasibility, 
constant objective function, and solution limit 1.) With a time cutoff of 30 minutes on a 
single-threaded Xeon computer, neither X nor Y could solve any of the 46 problems Al ， 
A2, Cl, C2, C3, C4, C5, C6, C8, Dl, D2, El, E2, FI, F2, Gl, G2, G5, G6, G7, G8, K7, 
K8, M5, M7, M8, Ol, 02, PO, PI, P2, Q7, S3, S4, T5, T6, T7, T8, W2, W4, XI, X3, 
X5, X6, X7, X8. (In particular, this list includes PO, S4, and XI, which are extremely 
easy for Algorithm C.) On the other hand both X and Y solved the langford problems 
L3 and L4 — which were the toughest for Algorithm C — in less than a second. 

Algorithm C performs about 20 Gfi per minute on a comparable Xeon. In these 
experiments it significantly outperformed the geometric methods except on problems 
KO, Kl, K2, L3, L4, and P4 (and some easy cases such as B2). 

Of course we must keep in mind that the particular clauses in Table 6 aren’t 
necessarily the best ways to solve the corresponding combinatorial problems with an 
IP solver, just as they aren’t necessarily the best encodings for a SAT solver. We are 
comparing here only black-box clause-solving speeds. 


521. A variety of simple schemes has been surveyed by S. Jabbour ， J. Lonlac, L. Sai.s ， 
and Y. Salhi ， arXiv: 1402.1956 [cs.AI] (2014)，13 pages. 

522. For cycles of length T we can introduce 27T variables xyzt for 1 < z < 3 and 
0 <t <T^ signifying that vertex z) occupies slot t in the path. Binary exclusion 
clauses -^xyzt V ， x’y z’ t ，, when xyz = x y z and t ★ t, or when xyz ^ xyz and t = 〆 ， 
ensure that no vertex appears twice in the path, and that no two vertices occupy the 
same slot. A valid path is specified via the adjacency clauses 


xyz t 


V V {xyz[ t+1)mod T I 1 < x 7 , y\ z' <3 and \x -x\ + \y -y\ + \z - zj = l}- 


We represent the shadows by introducing 36 variables a!6*, bah, a!*6, 6*a! ， *a!6 ， *bal 
for 1 < a < 2 and 1 < 6 < 3; here al*b (for example) means that the shadow of 
(x^ z) coordinates has a transition between (a ， 6) and (a+l ， 6). These variables appear 
in ternary clauses such as {-^xyzt V V x\*z) A {-^xyzt V -y{x+l)yz t > V xly*) 

whenever x < 3 and t f 三 t 士 1 (modulo T). To exclude loops we append clauses like 


-nl!l* V -i2!l* V -i31!* V i32!* V ， 2!3* V ， 22!* V -il!2* V -ill!*; 


this one excludes the loop in the example illustration. There are 39 such loop-defeating 
clauses, one for each of the 13 simple cycles in each shadow- 
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Finally we can break symmetry by asserting the unary clauses lllo, 112i 

without loss of generality, after verifying that no solution can avoid all eight corners- 

Clearly T must be an even number, because the graph is bipartite; also T < 27. 
If the method of exercise 12 is used for the exclusions, we obtain a total of 6264 clauses, 
822 variables, and 17439 cells when T = 16; there are 9456 clauses, 1242 variables, and 
26199 cells when T = 24. These clauses are too difficult for Algorithm D. But Algo¬ 
rithm L resolves them almost instantaneously for any given T; they turn out to be satis- 
fiable if and only if T = 24, and in that case there are two essentially different solutions. 
One of these cycles, due to John Rickard (who introduced this problem at Cambridge 
University，circa 1990), is beautifully symmetric, and it is illustrated on the cover of 
Peter Winkler^s book Mathematical Mind-Benders (2007). It can be represented by the 
delta sequence (322313312112322313312112), where ‘A:’ and c k^ change coordinate k by 
+1 or —1. The other is unsymmetric and represented by (332121331221232311312132)• 

523. (Solution by Peter Winkler.) With coordinates y^z)forl<x<m^l<y<n^ 
1 < 2; < 2, any cycle with loopless shadows must contain at least two steps (x, y, 1) —— 

and y\ 1 ) —— {x ^ y f ^2). We can assume that x < x and that x — x is 
minimum. The m x 2 shadow contains (x, 1) —— (x^ 2) and (x\ 1) —— (x\ 2)，together 

with (say) the path (x，1) - ($’，1)，but without the edge 2) —— （x"+l，2) for 

some x" with x < x r, < x . The unique shortest path from (x,y) to (x\ y) in the mxn 
shadow contains some edge (x",y n ) —— (x"+l，y"); hence (x 〃， y'l) —— (t"+ 1，""，1) 
must occur twice in the cycle. 

524. This problem involves clauses very much like those for a cyclic path, but simpler; 
we have T = 27 and no “wrap-around” conditions. With typically 1413 variables，10410 
clauses, and 28701 cells, Algorithm L shines again, needing only a gigamem or two to 
handle each of several cases that break symmetry based on starting and ending points. 
There are four essentially different solutions，each of which can be assumed to start 
at 111; one ends at 333, another at 133， another at 113， and the other at 223. Using 
the delta sequence notation above, they are: 33233233133233233133233233 (which is 
reflected ternary code); 31313311211331313231313311; 32323132323132323322122122; 
11221121312112211311221121. 

[Such paths，and more generally spanning trees that have loopless shadows, were 
invented in 1983 by Oskar van Deventer, who called them “hollow mazes”； see The 
Mathemagician and Pied Puzzler (1999)，213-218. His Mysterians puzzle is based on 
an amazing Hamiltonian path on P5 □ P5 □ P5 that has loopless shadows.] 

525. The author’s best solution，as of July 2015 ， had 100 variables, 400 clauses, and 
1200 literals (cells); it was derived from Tseytin’s examples of exercise 245, applied to a 
more-or-less random 4- regular graph of girth 6 on 50 vertices. Tseytin’s construction, 
with one odd vertex and 49 even ones, yields 400 clauses of 4SAT, which are quite 
challenging indeed. It can be simplified to a 3SAT problem by insisting further that 
every even vertex must have degree exactly 2 in the subgraph specified by true edges. 
(See K. Markstrom, J. Satisfiability, Boolean Modeling and Comp. 2 (2006) ， 221-227). 

That simplified problem still turned out to be fairly challenging: It was proved 
unsatisfiable by Algorithm L in 3.3 Tfi and by Algorithm C in 1.9 Tfi. (But by applying 
the endomorphisms of exercise 473， which broke symmetry by adding 142 clauses of 
length 6, the running time went down to just 263 M// and 949 M//，respectively-) 

Another class of small - yet-difficult problems is worth mentioning, although it 
doesn’t fit the specifications of this exercise [see L Spence, ACM J. Experimental Algo- 
rithmics 20 (2015)，1-4:1-1.4:14]: Every instance of 3D matching whose representation 
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as an exact cover problem has 5n rows and 3n columns，with five Is in each column and 
three Is in each row, can be represented as a SAT problem in 3n variables, lOn binary 
clauses, and 2n quinary clauses, hence only 30n total literals- This 5SAT problem has 
the same number of literals as the 3SAT problem discussed above, when n = 40; yet it 
is considerably more difficult if the matching problem is unsatisfiable. (On the other 
hand, the problem of this kind that defeated all the SAT solvers in the 2014 competition 
corresponds to a matching problem that is solved almost instantaneously by the dancing 
links method: Algorithm 7-2*2.ID needs less than 60 Mfj, to prove it unsatisfiable.) 

526. We prove by induction on |F| that it’s possible to leave at most w(F) clauses 
unsatisfied, where w(F) = 2 一 l c l: If all clauses of the multiset F are empty we 

have w(F) = \F\^ and the result holds- Otherwise suppose the variable x appears in F. 
Let l = x if w({C \ x E C E F}) > w({C | x G C G F}); otherwise l = x. A simple 
calculation shows that w(F\ l) < w{F). [JCSS 9 (1974)，256—278，Theorem 3-] 

999 - ... 
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GOLDSMITH 


The republic of letters is at present divided into three classes. 
One writer, for instance, excels at a plan or a title-page, 

another works away the body of the book, 

and a third is a dab at an index. 

— OLIVER GOLDSMITH, in The Bee (1759) 

When an index entry refers to a page containing a relevant exercise, see also the answer to 
that exercise for further information. An answer page is not indexed here unless it refers to a 
topic not included in the statement of the exercise. 


dS (boundary set), 58, 154 ， 180 ， 188. 

0-1 matrices ， 106—109, 151 ， 176—177, 181 ， 
see also Grid patterns. 

1SAT ， 49, 148. 

2- colorability of hypergraphs, 185. 

2SAT, 49, 51—54, 77— 78, 80, 101 ， 144 ， 

147, 149, 157, 159, 266- 

3- regular graphs, 147, 154, 231. 

3CNF ， 3, 148. 

3D MATCHING problem, 134, 225, 290-291. 
3D visualizations, 116-118. 

3SAT, 3—4, 47— 51 ， 59, 60, 78—80, 93—94, 131 ， 
135, 146, 148—151 ， 153, 182—184, 231. 

4- cycles, 109—110, 178, 225, 274. 

4-regular graphs, 290- 

4SAT ， 49, 51 ， 150, 290- 
5SAT ， 51 ， 58, 224, 291. 

6SAT ， 51. 

7SAT ， 51 ， 151. 

8 queens problem, 25, 282. 

90°-rotational symmetry, 138, 202, 275. 

100 test cases, 113-124, 127, 182, 184. 

0 (the empty set), 185. 
e (the empty clause), 3, 27 ， 185, 291- 
e (the empty string), 3, 85. 
e (the tolerance for convergence), 93-94. 

£ (offset in heuristic scores), 126, 213- 
vx (Is count), see Sideways sum. 

7r (circle ratio), see Pi- 
p (damping factor for variable activity )， 

67, 125—127, 155, 286. 

p (damping factor for reinforcement), 93—94. 
p (damping factor for clause activity), 

74, 125-127, 286. 
r parameter, 125—127, 235, 286. 
r(a, 6) function, 147- 
(f) (golden ratio) ， 146, 147, 160, 251- 
^ (agility threshold), 76—77, 124—127 ， 

240, 286. 

^ (confidence level), 93, 255. 

a.s.: almost surely, 149, 153- 
AAAI: American Association for Artificial 
Intelligence (founded in 1979); 


Association for the Advancement of 
Artificial Intelligence (since 2007), 67. 
Absorbed clauses, 168. 

Accordion solitaire, 282• 

Achlioptas, Dimitris 
Ar][ir\xpr]Q), 221. 

ACT(c), 74, 125. 

ACT(/c) ， 66- 68, 75, 125, 132. 

Active path, 13. 

Active ring, 32. 

Activity scores, 67, 74-76, 125, 132, 

155, 239- 

Acyclic orientation, 161- 
Ad ams, Douglas Noel (42), 126- 
Ad apt ive control, 46, 126. 

Addition, encoding of, 100—101, 114; see 
also Full adders, Half adders. 
Adjacency matrix, 281. 

Adjacent pairs of letters, avoiding, 248- 
AGILITY, 76, 158, 240. 

Agility level 5 76, 124, 158. 

Agility threshold ( 岭）， 76-77, 124—127 ， 

240, 286. 

Ahmed, Tanbir ( 万 现兑 ㈣ ) ， 5, 147- 
Alava, Mikko Juhani, 80. 

Aldous, David John, 219. 

Algorithm L Q ， 39 ， 147. 

Alice ， 20-24, 139-141. 

All-different constraint, 171. 

All solutions, 143, 266. 

Alon, Noga nro), 174, 254, 260. 

Aloul，Fadi Ahmed (J^iLaJI … U), 

112, 281, 284. “ 

Analysis of algorithms, 146—152, 

158-160, 164. 

Ancestors, 43. 

AND operation, 9 ， 10, 13. 

bitwise (: r & ?/) ， 28, 29, 31 ， 37, 38, 66, 68, 
76, 81 ， 196, 209—211 ， 220, 241. 

Andre, Pascal, 131. 

Anisimov, Anatoly Vasilievich (Ahhchmob, 
AHaTOJiHH BacMjrbeBm), 249- 
Annexstein, Fred Saul ， 274. 

Anti-maximal - element clauses, 56, 62, 97, 
115, 153, 155, 157, 167. 
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Antisymmetry ， 178- 
Appier dit Hanzelet, Jean, 57. 

April Fool ， 7. 

Ardila Mantilla, Federico, 256. 

Arithmetic progressions, 4, 114. 

avoiding, 135. 

Armies of queens, 180. 

Asm Acha, Roberto Javier, 267. 

Asserting clause, see Forcing clause. 
Associative block design, 4. 

Associative law, 227- 
Asymmetric Boolean functions, 178- 
Asymmetric elimination, 260. 

Asymmetric tautology, see Certifiable 
clauses. 

Asymptotic methods, 53—54, 147—151, 

164, 210, 226, 230, 283. 

At-least - one constraint, 171, 265- 
At-most-one constraint, 6, 97-99, 103, 

104, 120, 134, 149, 170, 171 ， 238, 

265, 266, 289. 

ATPG: Automatic test pattern generation, 
see Fault testing. 

Atserias, Albert Peri, 262. 

Audemard, Gilles, 72. 

Aurifeuille, Leon Frangois Antoine, 
factors, 14. 

Autarkies ， 44, 71, 146, 152, 177, 214, 

215, 217- 

testing for, 146, 214. 

Autarky principle, 44. 

Automatic test pattern generation, see 
Fault testing. 

Automaton, 272. 

Automorphisms, 108, 111, 180, 197, 

236, 277. 

Autosifting, 220. 

Auxiliary variables, 6, 8 ， 15, 17, 60, 97, 101 ， 
104, 105, 109, 135, 136, 148, 170-174 ， 
186, 262, 268, 276-279, 280-281, 287- 
AVAIL stack, 257. 

Averages, 120. 

Avoiding submatrices, 106-107. 

Awkward trees, 227. 

Axiom clauses, 54, 59, 100, 264, 266. 

Bacchus, Fahiem, 73, 271- 
Backjumping ， 64, 68, 74, 132, 233, 236, 239. 
Backtrack trees, see Search trees. 
Backtrackings 4, 27-34, 38-39, 64, 105, 

128, 129, 132, 151 ， 176, 190, 204, 

219, 231, 236. 

Bailleux ， Olivier, 8, 26, 135, 137, 143 ， 

174, 272. 

Baker, Andrew Baer, 98. 

Balas ， Egon, 206- 

Baldassi, Carlo, 93- 

Ball, Walter William Rouse, 180. 

Ballot numbers, 78. 


Balls and urns, 221. 

Banbara，Mutsunori ( 番原睦則）， 264 ， 

267, 268. 

Bartley, William Warren, III, 129. 

Basket weavers, 141. 

Batcher, Kenneth Edward, 266. 

Baumert, Leonard Daniel, 265- 
Bay ardo, Roberto Xavier, Jr. ， 132. 

Bayes, Thomas, networks, 95. 

BCP: Boolean constraint propagation, 
see Unit propagation. 

BDD: A reduced, ordered binary decision 
diagram ， 17-18, 102, 103, 132, 137, 148 
174, 181 ， 188, 193, 194, 197, 202, 220. 
BDD base, 219. 

Belief propagation, 95- 

Ben-Dor, Amir (m-p I’DN) ， 289. 

Ben-Sasson，Eli (p\y\y-p 57—58, 

153, 231. 

Benchmark tests ， 35 ， 131-133 ， 139 ， 

147, 190, 206. 

100 test cases ， 113-124, 127, 182, 184. 
Bender, Edward Anton, 250- 
Beresin, May, 275. 

Berghammer, Rudolf ， 204. 

BerkMin solver, 132. 

Berlekamp, Elwyn Ralph, 17- 
Berman, Piotr, 224. 

Bernhart, Frank Reiff, 188. 

Bernoulli, Jacques (= Jakob = James), 
distribution, multivariate, 89. 

Bethe, Hans Albrecht, 95- 
Better reasons, 157- 
Bias messages, 92. 

Biased random bits, 12, 241. 

Biere, Armin, v, 66, 76, 96, 129, 132, 166, 
188, 258, 260, 261 ， 269, 280. 

Big clauses, 145. 

BIMP tables, 36- 41 ， 43, 45, 124, 144, 235- 
Binary addition, 114. 

Binary clauses, 3, 6, 36, 124, 133, 155-156. 
Binary constraints, 171. 

Binary decoder, 179. 

Binary implication graph, see Dependency 
digraph, 41. 

Binary matrices ， 106-109, 151 ， 176-177, 

181, see also Grid patterns. 

Binary multiplication, 8. 

Binary number system, 9, 98- 
Binary recurrence relations, 189. 

Binary relations, 56. 

Binary search, 187. 

Binary strings, 181. 

Binary tensor contingency problem, 

142, 151. 

Binomial coefficients, 149. 

Binomial convolutions, 250. 

Bipartite graphs ， 58, 177, 290. 

Bipartite matching, 150- 
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Bipartite structure, 90. 

Birthday paradox, 49. 

Bishops, 141. 

Bitmaps ， 17 ， 139. 

Bitwise operations, 11, 12, 81, 158, 161, 
241, 246, 258-259. 

Black and blue principle, 146, 216. 

Black and white principle, 146. 

Blake, Archie, 130. 
blit, 234, 236. 

Block decomposition, 275. 

Block designs, 106- 

Block diagonal matrices, 177. 

Blocked clauses, 102, 215, 260, 261, 269- 
binary, 146- 
elimination of ， 167. 

Blocked self - subsumption, 167. 

Blocking digraph, 215. 

Blocks in Life ， 197, 200. 

Bloom, Burton Howard, coding, 258- 
Bloom, Thomas Frederick, 185. 

Bob ， 20-24, 115, 139-141. 

Bdhm, Max Joachim, 131. 

BolloMs, Bela, 54, 220. 

Bonacina, Maria Paola, 129. 
book graphs ， 126, 179. 

Boole ， George ， 129. 

Boolean chains, 9, 11, 12, 102, 114, 173- 
optimum, 178-179- 
Boolean formulas, 1- 
Boolean functions, 14-16. 
expressible in kCNF, 220. 
synthesis of, 178-179. 

Boppana, Ravi Babu, 174. 

Borgs, Christian, 54. 

Bottom-up algorithms, 252. 

Boufkhad, Yacine 8, 26, 131 ， 

135, 137, 143, 174/272. ** 

Boundary sets, 58, 154 ， 180 ， 188. 
Boundary variables, 230. 

Bounded model checking, 16—24, 132, 
137-141 ， 157, 179-180. 

Branching heuristics, 105, 144, see also 
Decision literals. 

Branching programs, 102, 173, 174. 
Branchless computation, 242* 

Braunstein, Alfredo, 90, 91, 256. 
Breadth-first search, 37, 43, 68, 130, 235. 
Break count, 79. 

Breaking symmetries, vii, 5, 19, 105—114, 
138, 176—181 ， 187, 188, 190—192, 238, 
267, 281-283, 285, 288-290. 
in graph coloring, 99-100, 114, 171, 

179, 187. 

Broadcasting, 170. 

Broad word computations, 11, 12, 158, 

161 ， 246, 258. 

Brown, Cynthia Ann Blocher, 30, 32, 

131 ， 151 ， 226. 


Brown, Thomas Craig, 185. 

Brummayer, Robert Daniel, 269. 

Brunetti, Sara, 206. 

Bryant, Randal Everitt, v, 7, 187. 

BST(D ， 211. 

BSTAMP counter, 211. 

Buckingham, David John, 197, 200. 

Buddy system, 36 ， 144, 235. 

Bugrara, Khaled Mohamed 

226. 

Bugs, 16, 69, 77, 133, 240. 

Bulnes-Rozas, Juan Bautista, 215- 
Bumped process, 21. 

Bundala, Daniel, 196- 
Burney, Charles, viii. 

Burns, James Edward, 204. 

Buro, Michael, 131- 

Buss, Samuel Rudolph, v, 153, 270. 

Bystanders, see Easy clauses. 

C-SAT solver ， 131- 
Cache memories, 24. 

Calabro, Chris, 288. 

Candidate variables, 40—44, 131, 214. 
Canonical forms, 138, 248. 

Cardinality constraints, 7-8, 26, 104, 106, 
113, 114, 121 ， 135, 143, 187, 188 ， 

193, 194, 196, 204, 285. 
for intervals, 100, 190, 280. 

Carlier, Jacques, 131. 

Carlitz, Leonard, 162. 

Carriers in Life ， 197, 200. 

Carroll, Lewis (= Dodgson, Charles 
Lutwidge), 129-130. 

Carry bits ， 9, 12, 101 ， 192, 193. 

Cartier, Pierre Emile, 83, 86 ， 163. 

Case analysis, 27, 130- 

CDCL (conflict driven clause learning) 

solvers ， 62—71 ， 103, 121 ， 132—133, 155. 
combined with lookahead solvers, 129. 
compared to lookahead solvers, 98—100, 
118—121 ， 182, 290. 

Cells of memory ， 28 ， 122-124* 

Cellular automata, 17, 202* 

Certifiable clauses, 168, 260. 

Certificates of unsatisfiability, 69—71, 

157, 169, 176, 178. 

Chaff solver ， 67 ， 132. 

Chain rule for conditional probability ， 254. 
Chains, see Boolean chains, Resolution 
chains, 5-chains. 

Channel assignment, 136. 

Channeling clauses, 264. 

Characteristic polynomial of a matrix, 

163, 218. 

Chavas, Joel, 91. 

Chayes, Jennifer Tour, 54. 

Chebyshev (= Tschebyscheff)，Pafnutii 
Lvovich (MeSBiineB'b, nacJ)HyTiH 
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JIi>BOBH x i r b = MeSBinieB, na<J)HyTHH 

JIbbobhm), inequality, 221* 
polynomials, 247. 

Cheshire Tom, 24—26, 115, 142-143- 
Chess ， 7 ， 170. 

Chessboards ， 18, 25, 99, 106, 115, 138, 180. 
Chiral symmetry (rotation but not 
reflection) ， 138, 202, 275- 
Chordal graphs, 163-164. 

Chromatic number x(^) 5 99, 135-136, 

147, 174, 281. 

Chung Graham, Fan Rong King 

(鍾金芳蓉 ) ， 283. 

Chvatal, Vaclav (= Vasek), 5, 52, 59^ 185- 
Cimatti, Alessandro, 132. 

Circuits, Boolean, 10, 101-103, 114, see 
also Boolean chains- 
Circular lists, 32. 

Clarke, Edmund Melson, Jr” 132. 

Clashing pairs of letters, 84. 

Clausal proofs, see Certificates of 
unsatisfiability. 

Clause activity scores, 74, 239. 
Clause-learning algorithms, 61—62, 103, 

118, 121 ， 132-133, 154-155. 

Clauses per literal ， 150, 231; see also 
Density of clauses. 

Claw graph, 249. 

Cliches, 76- 

Clique Local Lemma, 165- 
Cliques, 81, 100, 162, 167, 169, 171 ， 

179, 190- 
covering by, 165. 

Closest strings, 114, 181, 182. 

Clusters, 166- 

CNF: Conjunctive normal form, 9, 101, 

154, 173, 193, 196- 
Cocomparability graphs, 249, 250- 
Coe, Timothy Vance, 201. 

Coexisting armies of queens, 180- 
Cographs ， 163, 250- 
Cohen, Bram, 79, 246. 

Coja-Oghlan, Amin, 221. 

Colexicographic order, 206, 278. 

Coloring a graph, 6-7, 99-100, 153, 179, 260. 
fractional, 135- 
multiple, 135. 

of queens, 99-100, 114-115, 171- 
radio, 136- 
Column sums, 151. 

Commutative law ， 27, 180, 227. 

partial ， 83, 250-251. 

Comparator modules, 115, 137. 

Comparison, lexicographic, 101, 111-113. 
Comparison of running times, 34—35, 39, 

69, 97—100, 105—107, 110, 112, 118—128, 
182, 184, 218, 237, 264, 281 ， 290. 
Compensation resolvents, 39, 144, 147. 
Competitions, 131-133, 291. 


Complement of a graph, 134. 
Complementation of unary representations, 
100 . 

Complemented literals, 2-4, 37, 62—64, 

78, 111 ， 210, 266. 

Complete binary trees, 8, 135, 230. 

Complete bipartite graphs K m , n , 250, 254. 
Complete graphs K n , 153, 178 ， 186, 262. 
Complete /c-partite graphs, 250, 262. 
Complete t-ary trees, 160- 
Compressing, see Purging unhelpful clauses- 
Conditional autarkies, 215. 

Conditional expectation inequality, 150. 
Conditional symmetries, 107, see 
Endomorphisms- 

Conditioning operations (F1 1 and F \ L), 27, 
96, 143, 157, see Unit conditioning. 
Cones in trace theory, 87. 

Confidence level ( 岭 ) ， 93, 255- 
Conflict clauses, 63, 70, 171; see also 
Preclusion clauses- 

Conflict driven clause learning, 62—69, 

103, 121, 132—133, 155- 
Conflicts, 62, 124, 132. 

Conjunctive normal form, 1, 9, 101, 

154, 173, 193, 196- 
irredundant, 257. 

Conjunctive prime form, 104. 

Connected graphs, 177- 
Connectedness testing, 169-170. 

Connection puzzles, 170. 

CoNP-complete problems, 3, 207. 
Consecutive Is, 88, 175, 254. 

Consensus of implicants, 130. 

Consistent Boolean formulas, see 
Satisfiable formulas- 
Consistent partial assignments, 30, 165. 
Constrained variables in partial assignments 
165-166. 

Contests, 131-132. 

Context free languages, 175- 
Contiguous United States, 136. 

Contingency tables, binary, 142. 

3D ， 151. 

Convex functions, 216- 
Convex hulls, 247. 

Convolution principle, 250- 
Conway, John Horton, 17 ， 139, 201. 

Cook，Stephen Arthur ， 61 ， 62 ， 130-131 ， 

154, 229, 237. 
cook clauses, 157. 

Cooper, Alec Steven, 285- 
Core assignments, 166- 
Core of Horn clauses, 174, 216- 
Coupon collector’s test, 220- 
Covering assignments, 166, 221, 255. 
Covering problems, 2, 193, 194, see also 
Domino coverings. 

Covering strings, 181- 
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CPLEX system ， 26, 289. 

CPU: Central Processing Unit (one 
computer thread), 121. 

Crawford, James Melton ， Jr., 98, 113. 
Cray 2 computer, 137- 
Critical sections, 21—23, 140—141. 
Crossover point, see Threshold of 
satisfiability. 

Crusoe (= Kreutznaer), Robinson ， vii. 
CSP: The constraint satisfaction 


problem, 283. 

Cube and conquer method, 129- 
Cubic graphs (3-regular, trivalent), 

147, 154, 231- 
Cufflink pattern, 255- 
Culver, Clayton Lee, 185. 

Cut rule, 59- 

Cutoff parameters, 41, 145- 
Cutting planes ， 184, 206. 

Cycle detection problem, 260. 

Cycle graphs C n , 135, 160, 262. 

Cycle structure of a permutation, 108, 
112—113, 163, 178, 277- 
Cyclic DPLL algorithm, 33- 
Cyclic patterns, 19- 
Cyclic permutations, 163. 

da Vinci, Leonardo di ser Piero, 7. 
Dadda ， Luigi ， 9, 114, 136, 173- 
Dags: Directed acyclic graphs, 54. 

of resolutions, 54-56, 70- 
Damping factors ， 46, 67, 74, 76, 93—94 ， 
125, 126, 155- 

Dancing links ， 5, 121 ， 134, 208, 288, 291. 
Dantchev, Stefan Stoyanov (^aHMeB, 
CTe<J)aH Cto-hhob), 110. 


Darwiche, Adnan Youssef 

j 匕 jjp) ， 67 ， 262. 


Data structures, 28-34, 36-38, 43, 66—67, 
80, 95—96, 143—145, 155—156, 159 ， 

167, 238, 273- 

Davis，Martin David, 9, 31-32, 130, 298. 
Dawson, Thomas Rayner, 170- 
De Morgan, Augustus, laws, 3. 
de Vries, Sven, 206. 
de Wilde, Boris, 213- 
Deadlock, 22-23. 

Debugging, 69, 77. 

Dechter, Rina Kahana (lom m*n) ， 67- 
Decision literals, 62, 69, 124, 132. 

Decision trees, see Search trees. 
Decomposable matrices, 177. 

Default parameters, 93, 125-126. 

Default values of gates, 11. 

Definite Horn clauses, 174* 

Defoe，Daniel (= Daniel Foe), vii. 

Degree of a vertex, 191. 

Degrees of truth, 37—39, 42—43, 45—46, 216. 
Dekker, Theodorus Jozef, 140. 
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Del Lungo, Alberto, 206- 

Delayer, 55-56, 152-153- 

Deletion from a heap, 234. 

Delta sequence, 290. 

Demenkov, Evgeny Alexandrovich 

(^HeMeHKOB, EBreHHH AjieKcaH^npoBm )， 
280. 

Density of clauses: The number of clauses 
per variable, 50-51, 150, 231, 288. 

Dependence graph in trace theory, 248. 

Dependence of literals, 63. 

Dependency digraph (of literals), 41, 131, 
168, 215, 237, 260. 

Dependency-directed backtracking, see 
Backjumping. 

Dependency graph (of events), 82, 164, 165. 

Dependency on a variable, 137. 

Depth-first search, 130. 

Dequen, Gilles Maurice Marceau, 131. 

Determinants, 162, 163, 251. 

Deterministic algorithm, 17, 120. 

Deventer, Mattijs Oskar van, 290. 

DFAIL ， 46, 147. 

Dfalse literals, 45- 

Diagonals of a matrix, 24-25, 141-142. 

Diagram of a trace, 84. 

Diaz Cort, Jose Maria (= Josep) ， 51- 

Dick, William Brisbane, 180. 

Difficult instances of SAT, 5, 14, 26, 51, 
55—59, 118—121 ， 153—154, 184, 190 ， 

192, 197, 206, 280. 

Digital tomography, 24—26, 115, 141—143 ， 

167, 285. 

Digraphs ， 54, 108, 161 ， 162, 263, see also 
Blocking digraph, Dependency digraph, 
Implication digraph. 

Dijkstra，Edsger Wybe ， 22, 202, 204. 

DIMACS: Center for Discrete Mathematics 
and Theoretical Computer Science, 131. 

DIMA CS : DIMA CS Series in Discrete 

Mathematics and Theoretical Computer 
Science, inaugurated in 1990- 

Ding, Jian ( 丁剑 ) ， 51. 

Direct encoding, 98, 114, 171 ， 186, 

264, 265, 281. 

Direct sum of graphs or matrices, 162, 177. 

Directed acyclic graphs of resolutions, 

54- 56, 70. 

Directed graphs, see Digraphs. 

Discarding the previous learned clause, 

72, 156- 

Discrepancy patterns, 114, 182. 

Disjoint shortest paths, 276. 

Disjunctive normal forms, 14, 115, 

130, 195, 257- 

Distance d(u^v) in a graph, 262. 

Distinct literals, 2. 

Division of traces, 85, 161, 250. 
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DNF: Disjunctive normal form, 14, 115, 

130, 195, 257- 

Dodgson, Charles Lutwidge, 129-130. 
Domino coverings, 110, 114, 115, 143, 

177, 178. 

Don’t-cares, 194, 280- 

Double clique hints, 100, 114, 171. 

Double coloring, 115, 135. 

Double lookahead, 45-46, 126, 131, 286. 
Double order, 214. 

Double truth, 45. 

Doubly linked lists ， 28, 257, 259- 
Downhill resolution, 96, 166. 

Downhill transformations, 95. 

Doyle, Arthur Ignatius Conan, 72. 

DPLL (Davis, Putnam, Logemann, 
Loveland) algorithm, 32-33, 62. 
with lookahead, 38, 131. 

DT (double truth), 45- 
Dtrue literals, 45. 

Dual of a Boolean function, 130, 174. 

Dubois, Olivier, 131- 

Dudeney, Henry Ernest, 114, 263. 

Dufour, Mark, 37- 
Dull, Brutus Cyclops, 181. 

Durfee, William Pitt ， square, 276. 

Dynamic storage allocation, 144. 

Dynamical system, discrete, 16- 

e, as source of “random” data, 12, 193. 
Eager data structures, 30, 36, 156- 
Easy clauses, 149. 

Eaters in Life ， 20, 139. 

Een, Niklas G6ran, v ， 67, 96, 166, 

203, 260, 268. 

Ehlers, 士 horsten, 196. 

Eightfold symmetry, 138, 198. 

Elegance, 35-36, 196- 
Elimination of clauses, 167-168; see also 
Purging unhelpful clauses. 

Elimination of variables, 60 - 61, 95-97, 101, 
102, 129, 130, 154—155, 166—168, 173 ， 
174, 256-257, 259-260, 270, 272. 
Embedded graphs, 169, 262. 

Empilements, 84, 161, 248. 

Empirical performance measurements, 
122-124. 

Empty clause (e), 3, 27, 185. 

Empty list, representation of, 33, 210- 
Empty partial assignment, 166. 

Empty set (0) ， 185- 
Empty string (e), 3, 85. 

Encoding into clauses, 6, 18, 97-105, 120, 
134, 170, 179, 198, 202. 
ternary data, 100, 141, 179. 
Endomorphisms, 107—111, 177—178, 

181, 281, 290- 

Equal sums, encoding of, 174* 

Equally spaced Is, 4, 114, 135; see 
also waerden. 


Equivalence classes in trace theory ， 84. 
Equivalence of Boolean functions, 178. 
Erdos, Pal (= Paul) ， 81 ， 107, 190, 281. 

discrepancy patterns, 114, 179, 182. 

Erp rules ， 95-96, 166—168, 259. 

Evaluation of Boolean functions, 137, 
178-179, 194. 

Even-length cycles, 277. 

Even-odd endomorphisms, 110, 177-178. 
Exact cover problems, vii, 2, 5—6, 28, 134, 
183, 186, 219, 225, 257, 291. 
by pairs (perfect matchings), 109—110, 
see also Domino coverings, 
by triples (3D MATCHING), 134 ， 

225, 290-291. 
fractional, 135-136- 
Exclusion clauses, 6, 21, 99, 114, 134, 

149, 153, 238, 260, 289. 

Exclusive or, ternary, 136. 

Existential quantifiers, 60- 
Expander graphs, 58, 231. 

Exploitation stack, 259- 
Exploration phase of lookahead, 40, 43-44. 
Exponential time, 144. 
hypothesis, 288. 

Extended resolution, 60, 71, 133, 154, 

168, 215. 

Extreme distribution, 87, 89, 163. 

factor 一 10, V2, 114, 184, 192, 
factorJifo(m ， n ， z), 10, 114, 184, 192. 
factor-rand (m，n ， z ， s) ， 10 ， 184. 
Factorization, 8-10, 136, 184, 192. 

of traces, 86 ， 162, 250. 

Failed literals ， 97, 167, 175, 269. 

Fallacious reasoning, 16, 284* 

False hits, 258- 

False literals preferred, 31, 33, 67, 

125-127, 286. 

Fanout gates, 10—14, 136. 

Fat clauses, 58. 

Fault testing ， 10-14, 114, 126, 136—137 ， 
167, 260. 

Feedback mechanism, 46, 104. 

Fermat, Pierre de, 10. 

Fernandez de la Vega, Wenceslas, 52. 
Fibonacci, Leonardo, of Pisa (= Leonardo 
filio Bonacii Pisano), numbers, 

160, 215, 254. 
ruler function, 246. 

Fichte, Johannes Klaus, 262. 

Field of a variable, 91, 165. 

FIFO: first in, first out, 10. 

Finite-state automata, 175. 

First in, first out, 10. 

First moment principle, 53, 148, 150. 

First order logic, 59, 130- 

Fischetti, Matteo, 206- 

Fixed point of endomorphisms, 177- 
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Fixed values of literals, 37, 42-46. 

FKG inequality, 89. 

Flag bits, 235. 

Flammenkamp, Achim, 198- 
Flexibility coefficients, 91. 

Flickering state variables, 141. 

Flipflops in Life ， 138 ， 143. 

Floating point arithmetic, 91—92, 217, 239. 
overflow, 67- 

Floor tiling, 115, 143 ， 199. 

Flower snarks, 69, 147, 153, "157- 
Flushing literals and restarting, 68, 75-77, 
124, 132, 157, 158, 169, 234, 246. 
Foata, Dominique Cyprien, 83, 86, 163. 
Focus of attention, 41 ， 67, 91 ， 132. 

Foe, Daniel (= Daniel Defoe), vii- 
Footprint heuristic, 279. 

Forced literals, 45- 

Forcing clause, 62, see Unit propagation. 
Forcing representations, 104—105, 174, 

175, 274. 

Forests ， 43, 87, 163. 

Forgetting clauses, 168, 169, see Purging 
unhelpful clauses- 
Four bit protocol, 115. 
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Frances, Moti ， T)D), 285- 
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modified ， 114, 280. 
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Gadgets, 134 ， 183. 
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Game of Life ， 17— 20, 97, 114, 137—139 ， 

143, 167. 
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Gates ， 10—13, 101—103, 121 ， 136. 
GB_GATES, 13-14. 

Gebauer, Heidi Maria, 224. 

Geek art, 116-117- 
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Global ordering, 284. 
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Goultiaeva, Alexandra Borisovna 

(ryjibT>ieBa 5 AjieKcaH^pa BopncoBHa), 
73. 

Grabarchuk, Peter Serhiyevich (rpa6apnyK, 
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Graph quenching, 114 ， 179-180, 281. 
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Greedy algorithms ， 80 ， 136 ， 172. 
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Grid graphs, 110, 136, 151 ， 162-163. 
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Grid patterns, 17—20, 24—26, 137—139, 142. 
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Halevi，Shai (An289. 

Half adders ， 9, 192, 268. 
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paths, 184. 
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Han，Hyojung ( 赶直省）， 236- 
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Hanzelet, see Appier dit Hanzelet- 
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Header elements, 225- 

HEAP array, 67, 158, 233-235, 240. 

Heap data structure, 67, 214. 
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Hint clauses, 100, 114, 171. 
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Independent vertices, 7, 147. 
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125—127, 286. 

Initial state X 0 , 16-17, 21 ， 24, 140, 202. 
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Internet, ii, iii ， v, 118. 
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Irredundant CNF ， 257. 
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ISTACK array ， 38, 145. 
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Jacquet, Philippe Pierre, 225. 

Jagger, Michael Philip “Mick” ， 1. 

Janson, Carl Svante, v- 

Jarvisalo, Matti Juhani, 105, 132, 260, 261. 
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Jermm, Mark Richard, 151. 

Job shop problems, 172. 

Johnson, David Stifler, 184, 191. 
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rscopyiou), 51- 
Lamp or t, Leslie B., 24, 204. 
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Level 0, 62, 66 , 124, 156, 207, 233. 
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Levine, Eugene, 275. 

Lewis, Jerome Luther, 275. 
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element, 111, 283. 

Lexicographic order, 4, 25, 26, 30, 101, 105, 
107, 109, 111—113, 115, 197, 282-283. 
encoded in clauses, 101, 173, 174. 
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106—107, 177, 181 ， 274. 
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157, 282, 283. 
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161, 162, 250. 

Leyton - Brown, Kevin Eric, 125, 133. 
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Li，Wei ( 李未 ) ， 149. 
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Life, Game of ， 17— 20, 97, 114, 137—139 ， 

143, 167- 
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Matrix multiplication, 260- 
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Maximum independent sets, 87, 136, 
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135, 136, 177. 
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“Maybe” state ， 20. 

Mazurkiewicz, Antoni Wiesiaw, 83. 

McColl (= MacColl), Hugh ， 227. 
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graphs ， 7-8, 114-115, 134, 135, 188- 
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Median operation, 9, 136, 179- 
Median running times, 99, 120 - 124, 127. 
Megamem (M/i): One million memory 
accesses, 201. 

Mejean, Henri-Michel, 226- 
Mellin, Robert Hjalmar, transforms, 151- 
Mem (/x): One 64-bit memory access, 34. 
MEM array ， 66, 68, 124, 156- 
Memo cache, 60, 233. 

Memoization technique, 233. 

Memoryless property, 244. 

Menagerie, 116-117- 
Merge networks, 266. 

Merging lists, 231, 258- 
Mertens, Stephan, 51- 
Message passing, 90-95, 165-166. 

Method I, 61. 

Method IA, 61, 154. 
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Methuselah solitaire, 282. 

Methuselahs in Life, 19- 

Mezard, Marc Jean Marcel ， 51 ， 90, 91 ， 95. 
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Mills, Burton Everett, "130- 
Minesweeper, 142-143. 
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Minterms, 179. 

Mitchell, David Geoffrey, 50. 

Miters, 121, 182. 

Mitsche, Dieter Wilhelm, 51. 

Mixed metaphors, 76. 

Mixed-radix number systems, 268. 

MMIX computer, ii, 158- 
Mobile Life paths, 18-19, 138-139. 
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Mdbius, Augustus Ferdinand, functions, 86. 

series ， 86, 160, 162-163, 165, 247, 249. 
Mod 3 addition, 114, 179. 

Mod 3 parity, 179. 

Mod 4 parity, 179. 

Model checking, 16—17, 137—141 ， 179-180- 
Model RB ， 149. 

Modified full adders, 114, 280. 
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mone ( — 1) ， 242* 

Monien, Burkhard, 215- 
Monkey wrench principle, 113, 181. 
Monotone functions, 163- 
Boolean, 137, 281- 
Monotonic clauses, 5, 133, 157. 

Monotonic paths, 108, 276. 

Montanari, Andrea, 95. 
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92, 247, 268. 
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Mott-Smith, Geoffrey Arthur, 282. 

Move codes, 29-31 ， 34, 144, 145, 155, 210. 
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Mueller, Rolf Karl, 60, 130- 
Mliller ， Mike, 196. 

Multicommodity flows, 170. 
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Multilinear function, 86. 

Multiplication of binary numbers, 8—9, 

12-14, 114, 136, 173. 

Multiplication of traces, 85, 161. 

Multisets, 3, 214, 224, 250. 

Multivalued graph colorings, 99- 
Mutilated chessboard, 110, 114, 

177—178, 286- 

Mutual exclusion protocols, 20 - 24, 

115, 139-141. 

Mutzbauer, Otto, 275- 

Mux operation (u? v: 81, 102, 

152, 173, 219. 

Mysterians, 290- 

n-cube, 79, 136, 148, 184. 

n.f.: not falsified, 271. 

n queens problem, 25, 115, 171, 282. 

NAND operation, 60. 

Napier, John, Laird of Merchiston, 9, 173- 
Near truth, 37-39. 

Necessary assignments, 45, 146. 

Negated auxiliary variables, 105. 

Negative /c-clauses ， 157. 
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Niemela, Ilkka Niilo Fredrik, 105. 
Nieuwenhuis, Robert Lukas Mario, 267. 
Nightingale, Peter William, 265- 
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Nodes of a search tree, 34—35, 69, 124. 

Noels ， Alain ， 202. 

Noisy data, 181. 

Nonattacking queens, 25, 115, 171, 282. 
Nonaveraging sets, 114, 135- 
Nonchromatic rectangles, 176-177. 
Nonchronological backtracking, see 
Backjumping. 

Noncommutative variables, 162* 
Nonconstructive proofs, 57, 58, 81, 202. 
Nondeterministic finite-state automata, 175. 
Nondeterministic polynomial time, 131. 
Nondeterministic processes, 20, 141, 182. 
Nonintersecting paths, 170. 
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Nonnegative coefficients, 164. 

Nonprimary columns, 186. 

Nonterminal symbols, 175- 
Normal chains, 278. 

Normal functions, 279- 
Not-all-equal SAT, 185. 
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dS (boundary set), 58 ， 154, 180 ， 188. 

C 1 o C" (resolvent), 54, 152. 

C C C r (subsumption) ， 61, 152. 

Fl(F given l), 27, 96, 291. 

F L (F given L) ， 27, 103, 157. 

Fh C (F implies C) ， 59, 152, 153- 
F hi e, 70, 157, 175. 

F hi l, 103—104, 176- 

F 卜 fc e ， 

F \~ k l, 175-176. 

G ® H (direct sum), 162, 177- 
/| (a literal’s variable), 2. 

士 r (i ； or v)^ 2. 

(xyz) (median), 9- 

x Szy (bitwise AND), 28, 29, 31, 37, 38, 66, 
68, 76, 81 ， 196, 209—211 ， 220, 241. 

^ I y (bitwise OR) ， 43, 196, 241 ， 258-259. 
x ㊉ y (bitwise XOR), 28, 137, 196, 

208, 220, 241. 

x — y (monus), 92, 247, 268. 
x? y: z (if-then-else), 81, 102, 152, 

173, 219. 
w(a)^ 57. 
w(a h e), 57. 

||ah C\\, 57- 
/i(C) ， 59, 153. 

Novikov, Yakov Andreevich (Hobhkob ， 

-Hkob Ah 即 eeBMH), 70, 132* 

Nowakowski, Richard Joseph, 107, 275- 
NP-complete problems, 1 ， 3, 27, 87, 

130-131 ， 134, 142, 151 ， 181—183, 207 ， 
268, see also CoNP-complete problems. 
NT (near truth), 37-39- 
Null clause (e) ， 3, 27, 185, 291- 
Null list, representation of, 33, 210- 
Null partial assignment, 166- 
Null set (0), 185- 
Null string (e) ， 85- 
Nullary clause (e) ， 3, 27 ， 185, 291. 

Number theory, 14, 137 ， 192. 

Occurrence threshold of a graph, 160. 

Odd permutations, 218. 

Odd-even merge network, 266. 

Odd-even transposition sort, 263. 

Oliveras i Llunell, Albert, 267. 

On-the-fly subsumptions, 124, 156. 
One-in-three satisfiability, 183. 
One-per-clause satisfiability, 183- 
Open shop scheduling problems, 115, 
172-173- 

OR operation, 9, 10, 13, 258. 

bitwise O I y) ， 43, 196, 241 ， 258-259- 


Orbits of a permutation group, 108, 277- 
Order encoding, 98—101 ， 114, 120, 170—173, 
190, 268, 281. 

Order of a permutation, 111- 
Organ-pipe permutations, 171- 
Oriented cycle detection, 260. 

Oriented trees, 108. 

Orphan patterns in Life, 139. 

Orponen, Olli Pekka, 80. 

Oscillators in Life, 19, 138-139. 

Output states, 175. 

OVAL array ， 74, 125, 237, 240. 

Overfitting, 182. 

Overflow in arithmetic, 67, 240- 
Oxusoff, Laurent, 215. 

p (tautology, the always-true clause), 3, 58, 
60, 152, 180, 215, 226—228, 258. 

P = NP ， 1. 

Palindromes, 136- 

Panagiotou, Konstantinos (navayicoTou, 

KovaTavTivo(；) 5 221. 

Papadimitriou, Christos Harilaos 

(IIa7ia8rj{xr]Tptou 5 XpiaTO (； XaptXaou), 

77, 240, 241. 

Parallel multiplication circuits, 12—14, 137. 
Parallel processes, 20, 24, 121, 128-129. 
Parameters, tuning of, 80—81, 93—94, 
124-128. 

ParamILS, 125, 286-287. 

Parity-related clauses, 153—154, 172, 

178, 231-232, 290. 

Partial assignments, 30, 61, 62, 165, 176. 
Partial backtracking, 208. 

Partial latin square construction, 151- 
Partial orderings, 56, 85, 115, 248. 

of dimension < 2, 213. 

Participants, 41, 44, 145- 
Path detection, 169. 

Path graphs _P n , 84, 160, 253. 

Patience, see Solitaire games. 

Paturi, Ramamohan Jar6), 288. 

Paul, Jerome Larson, 5. 

Pauli, Marvin Cohen, 148. 

PC k , 176, 178. 

Pearl，Judea ( 力 £5 nnn ，）， 95. 

Pegden，Wesley Alden, v ， 164, 253. 

Peierls, Rudolf Erast, 95. 

Peres，Yuval (tH3 221. 

Perez Gimenez, Xavier, 51. 

Perfect matchings in a graph, 109—110, 177- 
Permanent of a matrix, 183, 251. 
Permutation polynomial of a set, 163. 
Permutation posets, 213- 
Permutations, 105, 265- 

signed, see Signed permutations, 
weighted, 163. 

Permuting variables and/or complementing 
them, see Signed permutations. 
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Peterson, Gary Lynn, 23, 115, 140, 204. 
Petrie, Karen Elizabeth Jefferson, 283- 
Phase saving, 67, 75. 

Phase transitions, 50-52, 149-150. 

Phi (0) ， 146, 147, 160, 251- 

Phoenix in Life, 198, 207- 

Pi ( 丌 ), as source of “random” data, 12, 

46, 108, 115, 147, 184, 193, 286; 
see also Pi function. 

Pi function ， 102 ， 174. 

Pieces, in trace theory, 84-87. 

Pigeonhole principle, 57- 

clauses for, 57—59, 105—106, 113, 153, 

176, 181 ， 186, 265- 

Pikhurko, Oleg Bohdan (IlixypKO, Ojier 
Bor 职 hobhh) , 285- 
Pile sums, 151. 

Pincusians, 133. 

Pipatsrisawat, Thammanit (= Knot) 

(#— ⑽初今滴 ， (= urn)), 67, 262. 
Pixel images, 200; see also Grid patterns. 
Plaisted，David Alan, 102. 

Planning, 132. 

Playing cards, 114 ， 180, 282. 

Points, abstracted, 106. 

Poison cards, 282. 

Poisson, Simeon Denis, probability, 225. 
Polarities ， 3, 67, 76, 207, 237. 

Polya, Gyorgy (= George), theorem, 284. 
Polynomials in trace theory, 85. 

Population in Life, 19. 

Portfolio solvers, 133. 

Posets, see Partial orderings- 
Positive autarkies, 146- 
Positive j-clauses, 157. 

Positive literals, 2, 132, 146. 

Posthoff, Christian, 275. 

Postorder, 42-43, 214. 

Postprocessor, 96. 

Preclusion clauses, 99, 171, 186. 

Preorder, 42—43, 214. 

Preprocessing of clauses, 95-97, 103, 
166—168, 182, 268, 272, 278. 

Preselection phase of lookahead, 40—42, 147. 
Prestwich, Steven David ， 264. 

Primary variables, 104, 105. 

Prime clauses, 174, 270, 273- 
Prime implicants, 281- 
Pringsheim, Alfred Israel, 88, 164. 

Prins, Jan Fokko, 267. 

Probabilistic method, 81- 
Probability of satisfiability, 47-54. 
prod(m ， n), 12—14 ， 114, 137. 

Production rules, 175- 
Profile of a search tree, 151. 

Progress, display of, 30, 145, 155. 

Progress saving, 67, see Phase saving. 
Projection of a path, 184. 

Projective plane, 274. 


Propagation, /cth order, 175—176, 273- 
Propagation completeness (UCi), 176- 
Proper ancestors, 164. 

Proto truth, 37, 42. 

Prover—Delayer game, 55—56, 152-153. 
PSATO solver ， 159- 

Pseudo-Boolean constraints, see Threshold 
functions- 

PT (proto truth) ， 37, 42. 

Pudlak, Pavel, 55. 

Puget, Jean-Frangois, 113- 
Purdom, Paul Walton ， Jr” 30, 32, 

131 ， 151 ， 226. 

Pure cycles, 140. 

Pure literals, 29, 31, 32, 34, 44, 60, 130, 

135, 146, 152, 208, 215, 227, 256, 

259, 268, 269, 275. 

Purging unhelpful clauses, 68, 71-75, 124, 
132, 157, 158, 168, 182, 184, 235. 
threshold for, 74, 125, 127- 
Putnam, Hilary, 9, 32 ， 130, 298. 

Pyramids in trace theory, 87, 162. 

q.s -： quite surely, 149, 153, 169. 

QDD: A quasi-BDD ， 188. 

Quad-free matrices, 106—107, 113, 

176—177, 274, 284. 

Quantified formulas, 60, 154. 

Queen graphs, 25, 99-100, 114—115, 

120, 171 ， 180, 181. 

Quenchable graphs, 179-180, 281- 
Quick, Jonathan Horatio, 181- 
Quilt patterns, 198. 

Quimper, Claude-Guy, 272. 

Quine, Willard Van Orman, 129, 130. 

7Z(G) (Local Lemma bounds), 82, 87—90, 

160, 163-165- 
Radio colorings, 136. 

Radix-d representation, 173. 

Rado, Richard ， 191. 

Ramakrishnan, Kajamalai Gopalaswamy, 16. 

ram an graphs, 231- 

Ramani，Arathi (邊 斤威發 112, 

281, 284. 

Ramanujan Iyengar, Srinivasa surmrv 

rrrrLnu^)i^(bw 漂 graphs, 154; 
see also ram an graphs. 

Ramos, Antonio, 75- 

Ramsey, Frank Plumpton, theorem, 81 • 

rand, 39-40, 46, 50, 115, 147, 182. 

Random bits ， biased, 12, 241. 

Random choices, 12. 

Random decision variables, 125—127, 

155, 286. 

Random graphs, 81. 

Random permutations, 233. 
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Random satisfiability problems, 47—54, 

91 ， 151. 

2SAT, 51—54, 149. 

3SAT, 39—40, 46—51 ， 59- 60, 80, 93—94 ， 
147—149, 153, 242. 

/cSAT, 49-51, 146, 148. 

Random walks, 77-81, 125, 243- 
Random words, 149. 

Randomized methods, 77, 129, 182, 210- 
RANGE scores, 74, 125—127, 158, 239. 

RAT, see Resolution certifiable clauses. 
Rauzy，Antoine Bertrand, 131, 215. 
Reachability in a graph, 169. 

Ready list, 32. 

Real roots of polynomials, 163, 249- 
Real truth, 37-39. 

Reasons, 62, 72, 157, 165, 233. 

Rebooting, 22. 

Reckhow, Robert Allen, 61. 

Recurrence relations ， 151 ， 177, 189, 215, 243. 
Recursive procedures, 27 ， 130 ， 172 ， 186, 233- 
Recycling of clauses, 66, 124. 

Reduction of clauses, 27, 143; see also 
Simplification of clauses. 

Redundant clauses, 257. 

Redundant literals, 65, 155-156, 232, 234. 
Redundant representations, 171. 

Reed, Bruce Alan, 52. 

Reflected ternary code, 290. 

Reflection symmetries, 112, 138, 156. 
Refutation chains, 57, 227. 

Refutation trees, 152* 

Refutations, 54—60, 70, 110 ， 152; see also 
Certificates of unsatisfiability. 

Regular expressions, 174-175. 

Regular resolution, 55, 152, 231- 
Reinforcement messages, 91-93. 

Reliability polynomials, 83. 

Reluctant doubling, ?T, 80-81, 159-160. 
Reluctant Fibonacci sequence, 160- 
Renamed Horn clauses, 176, 263. 

Repeated clauses, 49- 
Replacement principle, 96- 
Representation of Boolean functions, 104, 
see Encoding into clauses- 
Representing three states with two bits, 179. 
Rescaled activity scores, 67. 

Resende, see Guilherme De Carvalho 
Resende- 

Resizing of data structures, 210. 

Resolution certifiable clauses, 261- 
Resolution chains, 57-59, 152, 153, 227- 
Resolution of clauses, 54—65, 70, 101, 129, 
130, 144, 167, 185, 215, 224, 256- 
implementation of, 167- 
Resolution refutations, 54-60, 70, 110, 152; 
see also Certificates of unsatisfiability, 
extended ， 60, 71 ， 133, 154, 168, 215. 
regular, 55, 152, 231. 


treelike, 55—56, 152—153. 

Resolvable clauses, 164. 

Resolvent {C r o C") ， 54, 130, 152. 
Restarting, 80-81, 95, 125, 132. 

and flushing literals ， 68, 75-77, 124 ， 132, 
157, 158, 169, 234, 246. 

Restricted growth strings, 179. 

Restricted pigeonhole principle, 58. 

Reusing the trail, 75. 

Reverse unit propagation, 71. 

Revolving door Gray code, 282. 

Reynaud ， Gerard, 226. 

Richards, Keith, L 
Rickard, John, 290. 

Right division of traces, 85, 161. 

Right factor of a trace, 161. 

Riis ， S 0 ren M 0 ller ， 110. 

Ripoff, Robert Iosifovich (Phitob, Po6epT 
Hoch<J)Obhh) 5 7- 

Rivest，Ronald Linn, clauses, 4, 55, 

70, 134, 144, 182. 

Roberts, Fred Stephen, 136. 

Robertson, Aaron Jon, 185. 

Robinson, Gilbert de Beauregard, 275. 
Robinson, John Alan, 59, 96, 227- 
Rodriguez Carbonell, Enric ， 267- 
Rokicki, Tomas Gerhard, 200. 

Rooij, Iris van, 207. 

Rook paths, 206- 

Rookwise connected cells, 170- 

Ross, Kenneth Andrew, 282. 

Rotational symmetry ， 138, 202, 275. 

Rotors in Life, 138. 

Roussel, Olivier Michel Joseph ， 132, 272. 
Routing, disjoint, 170. 

Row sums, 151 • 

Roy, Amitabha (rfro 陌讯 ^), 113. 

RT (real truth), 37-39, 43. 

Ruler doubling, 160. 

Ruler of Fibonaccis, 246. 

Running times, 89-90. 

comparison of, 34-35, 39, 69, 97-100, 
105—107, 110, 112, 118-128, 182, 184, 
218, 237, 264, 281 ， 290. 
mean versus median, 120. 
worst case, 144, 146, 154. 

Runs of ls ， 26, 143, 175. 

5-chains, 52-53, 149- 
5-snares, 53, 149. 


50-54. 

& ， n ， 49—51 ， 148, 149. 

S 〈 • • 5 Xyi ) and {x \ ， • • • ， 3^) ， 8 

see Cardinality constraints. 

Saddle point method, 226. 

Sahni，Sartaj Kumar (fKdM 宅 HI t 

268- 

Sais, Lakhdar (® • 彡 © "•XE.O, 

236, 289 - ~ 
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Sakallah, Karem Ahmad (^Jil^Lu j^i 
fJ K) ， 112, 132, 281， 284 

Salhi，Yakoub ^289. 

Sampling with and without replacement, 
49—50, 132, 226. 

Samson, Edward Walter, 60, 130. 

SAT: The satisfiability problem, 3. 

SAT solvers, 1, 131-133. 

SATexamples•tgz ， 118. 

Satisfiable formulas, 1. 

variability in performance, 35, 120—121, 
128, 287. 

Satisfiability, 1-184. 

history ， 32, 59—60, 105, 129-133. 
thresholds for ， 50— 54, 91 ， 148—149, 221. 
Satisfiability-preserving transformations, 
107-113. 

Satisfying assignments, 1, 30, 143—144, 

166, 214, 219. 

SATzilla solver, 132-133. 

Schaefer, Thomas Jerome, 289. 

Schensted, Craige Eugene (= Ea Ea), 275. 
Schlipf, John Stewart, 274. 

Schmitt, John Roger, 285. 

Schoenfield, Jon Ellis, 192. 

Schoning, Uwe, 78. 

Schrag, Robert Carl, 132. 

Schroeppel, Richard Crabtree, 197. 
Schwarzkopf, Bernd, 282. 

Scott, Alexander David, 224, 251 ， 252. 
Scott, Allan Edward Jolicoeur, 207- 
Scott, Sidney Harbron, 191- 
Scoville, Richard Arthur ， 162. 

Search trees, 28—29, 32—34, 124, 152. 
expected size, 151—152. 
optimum, 144. 

Second moment principle, 54, 221, 222* 
Seitz, Simo Sakari ， 80. 

Self-subsumption, 96, 167, 168, 257. 
Selman ， Bart, 50, 79, 132. 

Semimodular lattices, 255-256- 
Sentinel values, 259. 

Sequential consistency, 24. 

Sequential lists ， 36—37 ， 144. 

Sequents, 59. 

Serial correlation coefficients, 143. 

Set partitions, 220- 

SGB, see Stanford Graph Base. 

Shadows of paths, 184. 

Shandy, Tristram, iii. 

Sharp thresholds, 51—52, "149- 
Shearer, James Bergheim, 82, 87, 160. 
Sheeran^ Mary, 203. 

Shlyakhter, Ilya Alexander (IIIjiiixTep, 

AjieKcaH^poBH^), 284. 

Shmoys, David Bernard, 267. 

Shortest paths, 262. 

Shortz, William Frederic, v- 


SIAM: The Society for Industrial and 
Applied Mathematics, 204. 

Sideways sum {ux): Sum of binary digits, 
114, 143, 179, 195, 279. 
second order (z/ 2 ):r), 143. 

Sifting ， 219, 220. 

Siftup in a heap, 234. 

Signature of a clause, 72, 158. 

Signature of a literal ， 258. 

Signed mappings, 180-181. 

Signed permutations, 4, 111, 178. 

involutions, 112-113, 180, 277-278. 

Silva, see Marques da Silva. 

Silver, Stephen Andrew, 138, 200. 
Simmons, Gustavus James, 192. 

Simon, Laurent Dominique, 72, 132. 

Simple cycles and paths, 23—24, 140. 
simplex graphs, 136- 

Simplification of clauses, 65, 155, 232; see 
also Preprocessing of clauses. 

Sims, Charles Coffin, tables, 283. 
Simultaneous read/write, 141. 
Simultaneous write/write, 141* 

Sinclair, Alistair, 80, 159, 256. 

Singh, Satnam, 203- 

Single lookahead unit resolution, 105, 176- 
Single-stuck-at faults, 10—14, 114, 136-137. 
Sink: A vertex with no successor, 87, 214. 

components, 108-110. 

Sinz, Carsten Michael, v, 8, 117, 118, 

135, 174, 189, 280. 

Skip Two solitaire, 282. 

Slack, in trace theory, 88, 251. 

Slisenko (= Slissenko), Anatol Olesievitch 
(CjiHceHKO, AHaTOJiB OjiecbeBHH), 59. 
SLS: Stochastic local search, 77- 
SLUR algorithm, 105, 176. 

Sly, Allan Murray, 51. 

Smile ， 207. 

Smith, Barbara Mary, 283- 
Snake dance, 138. 

Snakes ， 52—54, 149- 
Snares, 52—54, 149. 

Snark graphs, 69, 147, 153, 157. 

Snevily，Hunter Saint Clair ， 5. 

Socrates, son of Sophroniscus of 

Alopece (ScoxpaTrj (； Scocppooviaxou 
'AXcoTisxfjGsv), 129- 
Soft clauses, 168- 
Sokal，Alan David ， 251, 252. 

Solitaire games, 180, 282. 

Solutions, number of, 48, 219. 

Somenzi, Fabio, 236. 

Sorensson, Niklas Kristofer, v, 67, 

155, 203, 268. 

Sorting networks, 115, 137, 203, 263, 266. 
Source: A vertex with no predecessor, 

87, 252. 

Spaceships in Life, 139, 201- 
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Spanning trees, 281, 290- 

Sparse encoding, see Direct encoding. 

Speckenmeyer, Ewald, 131, 215. 

Spence, Ivor Thomas Arthur, 290. 

Spencer，Joel Harold ， 81 ， 82, 254. 

Spiral order, 206. 

Stable Life configurations, 19, 197. 

Stable partial assignments, 165-166. 

Stacks, 37—39, 43. 

Stacking the pieces, 84-85. 

Stalmarck, Gunnar Martin Natanael, 56, 
132, 153, 203, 232, 238. 
Stamm-Wilbrandt, Hermann, 131. 

STAMPCD, 258- 

Stamping of data, 37— 38 ， 64, 66, 145, 

155, 211 ， 236, 258-260. 

Standard deviation, 48, 240. 

Stanford GraphBase, ii, 12, 13, 126 ， 

179, 214, 231. 

Stanford University, 282. 

Stanley, Richard Peter, 275. 

Starfish graphs, 249. 

Starvation, 22-24, 115, 140, 141. 

Statistical mechanics, 90. 

Stators in Life, 138. 

Stege ， Ulrike, 207. 

Stein, Clifford Seth, 267. 

Steinbach, Heinz Bernd, 275- 
Steiner, Jacob, tree packing, 264. 

triple systems, 106, 274. 

Sterne, Laurence, iii. 

Stickel，Mark Edward ， 132. 

Sticking values, 67, see Phase saving. 

Still Life ， 19, 138, 200. 

Stirling, James, approximation, 221, 240. 

subset numbers, 149, 220, 226. 

Stochastic local search, 77. 

Stopping time, 48—50, 148. 

Strahler, Arthur Newell, numbers, 152. 
Strengthening a clause, 96, 156, 259-260- 
Stribrna, Jitka, 224. 

Strichman, Ofer 203- 

Strictly distinct literals, 2-3, 52, 165. 
Strings generalized to traces, 83. 

Strong components: Strongly connected 
components, 41-42, 52-53, 108, 

131 ， 215, 221 ， 263- 

Strong exponential time hypothesis, 183. 
Strong product of graphs, 134. 

Strongly balanced sequences, 179- 
Stuck-at faults ， single ， 10-14 ， 114, 136-137. 
Stiitzle, Thomas Glinter, 125. 

Subadditive law, 59. 

Subcubes, 148. 

Subforests, 42. 

Subinterval constraints, 190. 

Submatrices, 106—109, 177- 
Subset sum problem, 268. 

Substitution, 257. 


Subsumption of clauses, 61, 96, 124, 152, 

155, 156, 166—168, 181 ， 269- 
implement at ion, 167, 259. 
on-the-fly, 124, 156. 

Subtraction, encoding of, 100. 

Sudoku, 183, 225- 
Summation by parts, 48- 
Summers, Jason Edward, 200- 
Sun, Nike ( 孙妮克 )， 51 • 

Support clauses, 99, 114, 171. 

Survey propagation, 51, 90-95, 165-166, 213. 
Swaminathan, Ramasubramanian (= Ram) 
Pattu (fjrrm3rUfjm(^uj(bw 
^(SiirrLSfBfT^w)^ 274* 

Swapping to the front, 211, 242. 

Sweep of a matrix ， 108—109 ， 177- 
Swoop of a matrix problem, 109. 

Syllogisms, 129. 

Symeater in Life, 200- 

Symmetric Boolean functions, 179, 207, 219, 
270; see also Cardinality constraints. 
5<i, see At-most-one constraint- 
Si, 6 , 220 . 

5>i, see At-least-one constraint. 

135, 179, 256 - 

Symmetric threshold functions ，see 
Cardinality constraints. 

Symmetrical clauses, 105—106, 156- 
Symmetrical solutions, 138, 183, 274. 
Symmetries of Boolean functions, 178. 
Symmetry breaking, vii, 5, 105-114, 138, 
176—181 ， 187, 188, 190-192, 238, 267 ， 
281-283, 285, 288-290- 
in graph coloring, 99-100, 114, 171, 

179, 187. 

Symmetry from asymmetry, 19, 201. 
Synthesis of Boolean functions, 137, 

178-179, 194. 

Szabo, Tibor Andras, 224. 

Szegedy, Mario, 90, 161 ， 255- 
Szeider, Stefan Hans, 224, 284. 

Szemeredi, Endre, 59. 

Szpankowski, Wojciech, 225. 

尤 - snakes, 53, 54 ， 149- 
T/x: teramems = trillions of memory 
accesses, 110, 121, 126, 265, 281- 
Tableaux, 275. 

Taga, Akiko ( 多賀明子 ) ， 264, 267. 

Tajima，Hiroshi ( 田 ft 宏史 ) ， 100. 

Tak, Peter van der, 75- 
Takaki，Kazuya ( 高木和哉 ) ， 224. 

Tamura，Naoyuki (由村直 乏 ) ， 100, 171 ， 

264, 267, 268. 

“Take account,” 37, 43, 45—46, 217, 235. 
Tanjo, Tomoya ( 丹生智也)， 268. 

TAOCP: The Art of Computer 

Programming, problem, 115, 169. 

Tape records, 32* 

Tardos, Gabor, 82, 224, 254. 
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Tarjan, Robert Endre ， 41 ， 42, 214, 217. 
Tarnished wires, 13, 193- 
Tat ami tilings, 115, 143. 

TAUT: The tautology problem, 3, 129, 130- 
Tautological clause (p), 3, 58, 60, 152, 

180, 215, 226-228, 258. 

Tensors, 151- 

Teramem (T/x): One trillion memory 
accesses ， 40, 106, 107, 110, 217, 

218, 286. 

Ternary clauses ， 3-6, 36, 118, 131 ， 183; 
see also 3SAT. 

Ternary numbers, 100, 141, 179. 

Ternary operations, 9, 136. 

Territory sets, 84, 161, 163- 
Test cases, 113-124. 

capsule summaries, 114-115. 

Test patterns, see Fault testing. 

Tetris, 84. 

Theobald, Gavin Alexander, 190- 
Theory and practice, 109. 

Three-coloring problems, see Flower snarks. 
Threshold functions, 100—101, 175. 
Threshold of satisfiability, 50—54, 91, 
148-149, 221. 

Threshold parameter ㊀, 126, 213, 286. 
Thurley, Marc, 262. 

Tie-breakers ， 74, 239- 

Tiling a floor ， 115, 138, 143, 199. 

Time stamps, see Stamping of data. 
Timeouts, 120. 

TIMP tables ， 36— 40, 43, 45, 144-145. 

To-do stack ， 259- 

Tomographically balanced matrices, 141. 
Tomography ， 24-26, 115, 141-143, 167, 285. 
Top-down algorithms ， 252. 

Topological sorting, 85, 248. 

Tomses ， 134, 138, 200- 
Touched clauses, 44. 

Touched variables, 259. 

Tovey, Craig Aaron, 150, 223. 

Tower of Babel solitaire, 282* 

Tower of London solitaire, 282. 

Trace of a matrix: The sum of its diagonal 
elements, 108, 218. 

Traces (generalized strings), 83—90, 

161—162, 252, 254. 

Tradeoffs, 125—126. 

Trail (a basic data structure for Algorithm 
C) ， 62— 65, 68, 72, 124, 166, 236, 238. 
reusing, 75. 

Training sets ， 15—16, 115, 125—127, 133 ， 

137, 182, 286- 

Transitions between states, 16—24, 

175, 202, 218. 

Transitive law, 56, 228. 

Tree-based lookahead, see Lookahead forest. 
Tree function, 230. 

Tree-ordered graphs, 163—164. 


Treelike resolution, 55-56, 152-153. 
Treengeling solver, 121. 

Triangle-free graphs, 167. 

Triangles (3-cliques), 167, 238, 264. 
Triangular grids, 136. 

TYibonacci numbers, 216- 

Triggers, 46, 126- 

Trivalent graphs, 147, 154, 231. 

Trivial clauses, 124-127, 156, 236, 239. 
Trivially satisfiable clauses, 3. 

Truemper, Klaus, 273. 

TYuszczynski, Miroslaw (= Mirek) 

Janusz, 216. 

Truth, degrees of, 37-39, 42-43, 45-46, 216- 
Truth tables, 129-130, 179, 194, 220, 277. 
Tseytin, Gregory Samuelovich (U^hthh, 
rpnropHH UaMynjTOBMH), 9, 59—60, 71, 
133, 152, 154, 168, 178, 215, 231 ， 290. 
encodings ， 9, 17, 101-102, 136, 173, 195. 
encodings, half of, 192, 268. 

Tsimelzon, Mark Boris, 134. 

Tuning of parameters, 124—128, 133, 182* 
Turan, Pal (= Paul) ， 190. 

Turton, William Harry, 180. 

Two-level circuit minimization, 257. 

UC k , 176, 273. 

UIP: Unique implication point ， 132, 233- 
Unary clauses, see Unit clauses- 
Unary representation (= order encoding), 
98—101 ， 114, 120, 170—173, 190 ， 

268, 281. 

Undoing, 28—31, 37—39, 95—96, 143—145 ， 

208, 212, 217-218. 

Uniform distribution, 159- 
Unique implication points, 132, 233. 
Uniquely satisfiable clauses, 48, 219. 

Unit clauses (= unary clauses), 3, 6, 9, 13, 
21, 23, 30, 31, 33, 35, 36, 66, 70, 130, 
144, 151 ， 157, 192, 205, 210, 238, 290. 
Unit conditioning, 27, 96, 166, 259, 261- 
Unit propagation ( 卜 i), 31-34, 36, 62, 65 ， 

68, 70—H ， 93, 97—99, 103—104, 132, 155, 
157, 165, 171 ， 174, 236, 270, 272, 276. 
generalized to 卜 / ^ 175. 

Universality of Life, 17. 

Unnecessary branches, 55, 227- 
Unsatisfiable core, 185. 

Unsatisfiable formulas, 1. 

implications of, 104, 175-176- 
Unsolvable problems, 130- 
Urns and balls, 221. 

Urquhart, Alisdair Ian Fenton, 231. 

VAL array, in Algorithm C, 66-68, 73-76, 
233-236, 238, 240- 
in Algorithm L, 37-39, 43, 216. 

Valid partial assignments, 165-166. 

Van de Graaff, Robert Jemison, 198. 
van der Tak, Peter, 75. 
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van der Waerden，Bartel Leendert, 4. 

numbers, 5, see VF(/co, •…， 一 i). 
van Deventer, Mattijs Oskar, 290. 

Van Gelder ， Allen ， 71 ， 233, 237, 263- 
van Maaren, Hans, 37, 46- 
van Rooij, Iris, 207. 
van Zwieten, Joris Edward, 37. 

VAR array, in Algorithm L, 38 ， 182, 211. 
Variability in performance on satisfiable 
problems, 35, 120-121 ， 128, 287. 
on unsatisfiable problems, 69, 121, 

128, 287- 

Variable elimination, 96-97, 101, 102, 

129, 154—155, 166-168, 173, 174 ， 
256-257, 259-260, 270, 272. 

Variable interaction graphs, 116—118, 182. 
Variables, 2. 

introducing new, 3, 6, 8, 9, 13, 60; 
see Auxiliary variables, Extended 
resolution. 

Variance ， 49, 158, 164, 240, 243. 

Vassilevska Williams，Virginia Panayotova 
(BacHjreBCKa ， BupraHH^i naHafioTOBa), 

167- 

Vaughan, Theresa Phillips, 162. 
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